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SEQUENTIAL PATTERNS AND MAXIMIZING}! 


CAMERON R. PETERSON? axp Z. J. ULEHLA? 


University of Colorado 
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because they expect sequ 
ment was to determine w 
tenability of sequential 
maximizing responses. 

the random generation o. 
dependencies were obje 
presented prearranged seque 
pate sequential patterns. 
pothesis; experimental con 
did control conditions 


In typical probability-learning ex- 

eriments, S can maximize the ex- 
ected number of correct predictions 
py always predicting the most fre- 
quently occurring event; thus each 
rediction of the most frequent event 

can be termed a maximizing response. 
aximize on 


Most Ss do not learn to m 
every trial. Several hypotheses have 


been advanced as explanations of this 
suboptimal behavior. A major hy- 
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ential patterns. 
hether or not the elimination of the objective 
dependencies would increase the proportion of 
21 Ss in the experimental condition controlled 
f events by the throw of a die so that sequential 
ctively unreasonable. i 
nces, making it reasonable for S to antici- 
Results confirmed the experimental hy- 
ditions led to more maximizing responses than 
at the .01 level of significance. 


rning experiments do not maximize, perhaps 


The purpose of this experi- 


21 control Ss were 


pothesis is that Ss respond not only 
to the unconditional event probabili- 
ties, but also respond to hypothesized 
sequential dependencies or patterns 
in the sequence of events (e.g., see 
Restle, 1961, pp. 107-113). — 
Three lines of evidence support the 
hypothesis that many Ss do search 
for sequential patterns: (a) Ss say 
that they are looking for patterns 
when protocols are obtained via the 
“thinking aloud" procedure (Feld- 
man, 1962); (b) Ss show recency 
effects (Jarvik, 1951) indicating that 
they respond to previous sequences di 
outcomes; and (c) Ss are capable of 
finding patterns when they do exist in 
the sequence of outcomes 
ig (Anderson, 
If hypothesized sequential patterns 
constitute one important reason for 


2 


lack of maximizing, then elimination 
of the objective basis for sequential 
patterns should increase maximizing. 
The purpose of this experiment was 
to compare maximizing in two condi- 
tions: (a) in the experimental condi- 
tion S controlled the random genera- 
tion of events so that he knew that no 
patterns were arranged by E; (b) in 
the control condition each sequence 
which emerged in the experimental 
condition was presented to a new S 
(here the sequences were determined 
in advance of presentation to S, mak- 
ing it objectively reasonable to antici- 


pate sequential patterns). The ex- 
perimental hypothesis was that the 
proportion of maximizing responses 


would be greater in th 


d € experimental 
condition, 


METHOD 


CAMERON R. PETERSON AND Z. J. ULEHLA 


of sequential dependencies than the apparatus 
for the control group. The essential difference 
was that the correct outcome (“black" or 
“white’’) on each trial was determined (a) in 
the experimental group by which side of a die 
rolled by S turned up and (b) in the control 
group by which color was on the back of acard 
turned up by E, the cards being stacked in a 
fixed sequence prior to the experiment. 

Subjects and payment.—kForty-two Uni- 
versity of Colorado students Served as Ss, 
Twenty-one .Ss participated in the experi- 
mental condition and 21 in the control 
condition. 

The Ss were paid 2 cents for each Correct 
prediction and fined 2 cents for cach incorrect 
prediction. Maximizing on every trial would 
yield an expected earning for the experiment 
of $2.00, more than any other behavior. For 
example, matching the event probabilities 
would yield an expected earning of $.67. 
The Ss were obtained from the student em- 
ployment office in order to enhance the utility 
of the earnings. 

Apparatus.—The apparatus for the ex- 
perimental conditions consisted of a die and 
adice cup. Two sides of the die were covered 
with white tape and four sides were covered 
with black tape. The apparatus for the 
control condition consisted of a deck of 4 X 54 
in. opaque cards and a desk-calendar-type 
spindle which permitted the sequential ob- 
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Fic. 1. 


Mean proportion of maximizing resp 
experimental group and for Ss in 


BLOCKS 


onses as a function of trial blocks for Ss in the 


the control group. 
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servation of the back of each card. A small 
square of either black tape or white was 
attached to the center of the back of each 
card. Four black and two white squares were 
displayed on the spindle as a constant re- 
minder to S that there were approximately 
twice as many black as white squares in the 
deck. 

Procedure.—An experimental trial con- 
sisted of requiring each S to: (a) predict which 
color would turn up upon a roll of the die, 
(b) roll the die, and (c) receive the 2-cent 
reward or penalty. A control trial consisted 
of requiring each S to: (a) predict which color 
would be on the back of the next card in the 
deck, (b) observe the correct color, and 
(c) receive the 2-cent reward or penalty. 

The actual sequence of colors for each 
control S was determined as follows. One 
control S was paired at random with each 
experimental S. The sequence of colors in 
each control S's deck was prearranged to cor- 
respond with the sequence of colors actually 
rolled by the experimental S with whom he 


was paired. Each experimental and control 
S matched by sequence was considered as a 
matched pair where appropriate for the data 
analysis. 


RESULTS AND CONCLUSION 


Figure 1 presents the mean propor- 
tions of maximizing (black) predic- 
tions for the experimental and control 
groups during each of the 12 blocks 
of 25 trials each. The experimental 
group maintained approximately a .1 
advantage throughout all blocks of 
trials. Since learning to maximize in 
both conditions seems to reach its 
peak after 200 trials, the last 100 trials 
were evaluated in terms of a £ test, 
the results being ¢ (20) = 3.48, p < .01. 

Figure 2 presents the number of Ss 
who maximized completely and the 


5 EXPERIMENTAL 


Subjects 


of 


CONTROL 


Number 
a 


| 


complete 


maximizing 


.595 .645 .695 45 


Proportion of 


Fic. 2. 


795 


MAXIMIZING 


.895 945 .995 


RESPONSES 


Proportion of the last 100 trials on which individual 
experimental and control 5s maximized. 
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number of Ss who fell into each .05 
interval short of maximizing. More 
experimental than control Ss exceed 
any arbitrary criterion of tendency to 
maximize between .65 and .99. 
Clearly the experimental condition 
results in more maximizing than does 
the control condition. The experi- 
ment was designed to vary objective 
tenability of sequential dependencies. 
To the extent that this may be con- 
sidered the essential difference be- 
tween the experimental and control 
conditions, the experimental hypoth- 
esis is supported—elimination of an 
objective basis for sequential depend- 
encies leads to more Maximizing. 
This conclusion, in turn, supports the 
more general hypothesis that Ss’ an- 
ticipation of sequential dependencies 


is one of the factors leading to non- 
maximizing responses in probability- 
learning experiments. 
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NUMBER AND TYPE OF AVAILABLE INSTANCES 


Veterans Administration Hospital, Oklahoma City 


IN CONCEPT LEARNING ! 


VLADIMIR PISHKIN AND AARON WOLFGANG 


‘This study explored the effects of availability to S of 2 types of stimulus 
instances, correctly and incorrectly sorted, on performance in a 4-choice 
concept-learning task. Errors to problem solution were reduced by the 
availability of 1 instance in each of the 4 response categories. Addi- 
tional instances beyond 1 had no appreciable effect. Performance when 
only incorrectly sorted instances were exposed was unaffected by 
availability. "There was no significant interaction between number and 


University of Oklahoma School of Medicine 


type of available instances. 


The function of S’s memory in con- 
cept learning has been discussed 
recently by Hunt (1962, p. 143), who 
pointed out that, in Restle's (1960, 
1961) model, concept learning may be 
assumed to occur without memory. 
The "no memory" model emphasizes 
hypothesis testing by .S from a uni- 
verse of potential hypotheses, where 
sampling of these is characterized by 
periodic selection and rejection of in- 
correct hypotheses. In this model the 
hypotheses simply continue to be re- 
placed until S arrives at a correct one, 
although S may remember whether 
his working hypothesis has been 
correct for some of the preceding trials 
(Trabasso & Bower, 1964). 

Somewhat contrary to the tenets of 
the "no memory” model is recent 
evidence that availability of instances 
from past trials facilitates concept 
performance (e.g. Cahill & Hovland, 
1960). : 

Typically, stimuli to be categorized 
in concept learning are presented. in 
succession and previous information 
must be retained in memory. _ Hov- 
land and Weiss (1953) and Cahill and 
Hovland (1960) assessed the effect of 

1 Supported by Veterans Administration 
Medical Research Program (8,200 Funds). 
The valuable suggestions of Lyle E. Bourne, 
Jr. are gratefully acknowledged. 


EZ 


memory for previously presented in- 
formation by presenting the stimuli 
either successively, with no past in- 
formation available, or simultaneously, 
with all past information exposed. 
The simultaneous procedure yielded 
more rapid learning than the succes- 
sive presentation method. Ina more 
recent experiment, Bourne, Goldstein, 
and Link (1964) investigated the 
number of available instances as a 
variable in concept learning. Groups 
which had up to five stimuli exposed 
tended to make fewer presolution 
errors than groups with less; there was 
a tendency for availability to produce 
a performance decrement with more 
than five stimuli exposed. The above 
studies demonstrate clearly that Ss 
forget previously presented informa- 
tion and that .S has at least two tasks 
in a concept problem ; first, remember- 
ing information and secondly, on the 
basis of memory and of currently 
available stimuli, determining  E's 
classification rule. 

Earlier concept-learning studies left 
available to .S instances which had 
been categorized both correctly and 
incorrectly. The present study was 
designed to explore the functional 
relationships between concept learn- 
ing and (a) amount (five levels) of 
available information from past trials 
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and (b) availability of three types of 
stimulus instances which were sorted 
by Ss, viz., those sorted correctly, 
incorrectly, or a mixture of both cor- 
rectly and incorrectly sorted instances. 


METHOD 


Subjects.—The Ss were 90 male and female 
volunteer students from classes in elementary 
psychology at the University of Oklahoma. 
They were randomly divided into 15 treat- 
ment groups of 6 Ss each. 

Design.—A 3 X5 factorial design was 
used. It included three conditions of type of 
available instances, viz, (PF) correctly 
sorted only, (NF) incorrectly sorted only, and 

(PNF) both types. Positive, negative, or 
combination of positive and negative feedback 
was given verbally by E in PF, NF, and PNF 
conditions, respectively. In addition, there 
were five conditions of number of available 
instances, i.e., zero, one, two, three, or four of 
the previously presented stimulus cards in 
each of the four categories were in S's view 
on any trial. These conditions are referred to 
as n-0, n-1, n-2, n-3, and n-4, respectively. 
s ions began by responding 
to a single stimulus, In availability condi- 
tions greater than n-0 this card remained in 
view while the next was presented. Once the 
specified level of availability was reached the 
earliest of all exposed stimuli was removed 
upon the presentation of each new one. Note 
that in Availability Cond. n-0 all levels of 
"type" of instances were treated identically, 
since no stimuli were available from previous 
trials. 
Task and apparatus.—The S's task was to 
categorize a series of geometric patterns in 
accordance with one relevant dimension. The 
relevant dimension for all Ss was the form of 
the pattern. The test material was the 
Wisconsin Card Sorting Test, a pack of 64 
geometric patterns, each containing one, two, 
three, or four identical figures (triangles, 
Squares, crosses, and circles) in one of 
four colors (red, green, yellow, or blue). 
A gray wooden sorting tray, with four double 
compartments, was used. The upper half of 
the compartment was used for stimuli to 
which .$ made the wrong responses and the 
lower half for stimuli which .S placed cor- 
rectly. Each compartment was labeled A; 
B; C, or D; 

Procedure.—At the start of each session .S 
sat at a desk directly opposite E. The 
Wisconsin Card Sorting pack and the sorting 


tray were in front of .S. 
for the PF condition follow: 


The instructions 


Listen carefully to the instructions. When 
I say "Begin," turn over the top card. 
Tell me where you think each of the cards 
should go, in Slot A, B, C, or D. I will tell 
you when you are right, and if you arc 
right, lay the card face up in the slot you 
chose in the front row. The times you are 
not right, I will say nothing and you can 
lay the cards face down in the back row. 
Only one card (two, three, or four, depend- 
ing on the condition) should be exposed at 
any one time in any one slot. Try to be 
right as often as possible. When you think 
that you have solved the problem stop and 
tell me what you think the answer is. 
(Face down was used in the condition where 
no past trials were exposed.) 


The instructions for the NF condition were 
the same as above except that Ss were told 
only when their responses were wrong and 
they were to lay the cards face up in the back 
row when wrong and face down in the front 
row when right; both front and back rows 
were exposed for PNF condition. The task 
ended when 64 trials had elapsed or when the 
S could correctly verbalize the solution; for 
example, crosses go in A, triangles in B, etc. 
There was no limit on the response-time 
interval, 
RESULTS 


The analysis of variance on the 
number of errors as a function of type 
and number of instance 


s available was 
performed. Both the main effects of 


type, F (2,75) = 5.79, p < .01, and 
number of available stimuli, F (4, 75) 
= 2.85, p < .05, were significant, but 


there was no Type X Availability 
interaction, F (8, 78) = 81, 5 > 05. 
Subsequent analyses with Duncan’s 
test reveal, as does Fig. 1, that the 
greatest improvement in performance 
is from n-0 to n-1 level. In all com- 


n-4 (df = 85, p < :05). There were 
no significant differences between any 
other two levels of availability, Or. 
thogonal polynomial analysis for the 
four components of trend Was per- 
formed on the availability effect with 


NUMBER AND TYPE OF AVAILABLE INSTANCES 7 


MEAN ERRORS 


2 3 4 


AVAILABILITY LEVELS 


Fic. 1. 
of stimulus instances. 


linear and quadratic components 
showing significance, as suggested by 
Fig. 1 (Fin 5.67, Faua = 4.61, 
p < .05), which supports the conclu- 
sion that availability beyond z-1 is 
not facilitative. 

The errors were most numerous in 
the NF group. Cross comparisons 
with Duncan's test between the three 
types of available instances showed 
that only the differences between NF 
and PF and between NF and PNF 
conditions were significant (df = 87, 
p < 05). Figure 1 shows that in the 
n-0 availability condition, where no 
previous information was available, 
type of instances was not an important 
factor; but, when information from 
past trials was provided, Ss who were 
permitted to view only their wrong 
responses (NF) perform consistently 
worse across all availability levels 
than Ss in PF and PNF groups. 
Thus, the overall significance of types 
of instances and availability levels is 
attributed to slower learning where 
(a) only incorrect instances were €x- 


Mean errors as a function of availability levels and three types 
(Each point represents N of 6.) 


posed and (b) where none of the 
previous stimuli were available. 


DISCUSSION 


The fact that PNF and PF groups 
with 2-1 or more availability performed 
equally well and were superior to NF 
demonstrates that the type of stimulus 
instances available, rather than E's feed- 
back per se, facilitated solution. This 
conclusion emerges from the finding that, 
in the 7-0 availability condition, Ss did 
equally well, regardless of the type of 
feedback (Fig. 1). 

The finding that availability of past 
information had a facilitative effect in 
concept learning was consistent with 
results of Hovland and Weiss (1953), 
Cahill and Hovland (1960), and Bourne, 
Goldstein, and Link (1964). Specifically, 
present evidence indicated that the 
greatest improvement in performance 
was from 2-0 to Availability Level n-1 
(up to four previous stimuli exposed). 
The relative simplicity of the problem 
used in the present study might serve to 
explain why Ss failed to benefit from 
past available information beyond 1-1. 
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Bourne et al. (1964) found that availa- 
bility of previous stimuli was less facilita- 
tive in problems of lesser. complexity. 
And since, in the present study an un- 
limited response interval was used, it 
may be ruled out that Ss did not have 
sufficient time to assimilate information 
exposed at the higher availability levels. 
With availability levels over 2-1, PNF 
groups and PF groups were clearly 
superior in overall performance to NF 
groups. The result is likely due to Ss 
having in view some correctly sorted 
instances, i.e., instances of what concept 
is, rather than incorrectly sorted in- 
stances which provide information what 
the concept is not. Formally, correctly 
sorted stimuli are positive instances and 
incorrectly sorted ones negative. [t is 
well known that concept learning pro- 
ceeds more rapidly on the basis of 
positive as compared to negative in. 
stances (Hovland & Weiss, 1953). That 
PF and PNF groups do equally well 
further testifies to the fact the Ss in the 
PNF group gained very little information 
from having Past errors available. 
| Facilitative effects of availability of 
memory" information from past tri 
lend no support for the "no me 
assumption in concept-le. 
Present results stress a 


als 
mory” 
arning models. 
t least limited 


utilization of previous instances, as was 
recently suggested by Trabasso and 
Bower (1964), who found that Ss 
remembered information in their working 
hypothesis, since on solved problems, the 
recall of the relevant dimension was 
significant. 
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RELATION BETWEEN STIMULUS INTENSITY AND 
OPERANT RESPONSE RATE AS A FUNCTION OF 


DISCRIMINATION TRAINING AND DRIVE! 


EFFREY A. GRAY? 
J 


Institute of Psychiatry, University of London 


Noise intensity was varied while rats performed a bar-pressing response 
for food on a VI schedule. For 4 Ss (the Discrimination group) the 
noise was present only during periods of reinforcement; for 4 others 
(the Nondiscrimination group) noise was uncorrelated with reinforce- 
ment contingencies. Results indicate that response rate was a positive 
function of SP intensity in the Discrimination group and essentially 
unrelated to noise intensity in the Nondiscrimination group. In the 
Discrimination group response rate during periods of S^ wasa negative 
function of preceding SP intensity. In a second experiment the effects 
of drive on these relationships were investigated. Although there were 
no significant interactions between drive and SP intensity, the slopes 
of the curves relating response rate to noise intensity were systematically 


altered by variation in drive. 


Results are discussed in the light of 


Pavlovian theory and Perkins’ (1953) generalization of inhibition 
account of stimulus-intensity dynamism. 


In contrast to the large body of 
evidence gathered by Russian in- 
vestigators of the classical condition- 
ing situation that the intensity of the 
conditioned stimulus is an important 
determinant of the magnitude of the 
conditioned response (Gray, 1964; 
Razran, 1957), Western evidence on 
the same point remains scarce and, 
until recently, largely negative. A 
number of workers (Carter, 1941; 
Grant & Schneider, 1948, 1949; Kim- 
mel, 1959; Kimmel, Hill, & Morrow, 
1962; Passey & Herman, 1955; Wil- 
cott, 1953) have reported failures to 
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thanks are due to P. L. Broadhurst for his 
advice and supervision, to H. J. Eysenck for 
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reproduce the phenomenon described 
in Pavlovian terms as “the law of 
strength," ie. that, up to a limiting 
value of CS intensity, this parameter 
and response magnitude are positively 
related. Other workers have pub- 
lished results which are in agreement 
with the Russian findings, but have 
not found the law of strength to be a 
very clearly demonstrable phenom- 
enon. Thus, Hovland (1937), a sta- 
tistical analysis of whose results was 
performed by Hilgard and Marquis 
(1940), and Miller and Greene (1954) 
reported findings which were in the 
direction suggested by the Russian 
studies, but which failed to reach an 
acceptable level of statistical signifi- 
cance. Passey and Possenti (1956) 
found a difference, in the expected 
direction, in number and latency of 
alley-running responses for food re- 
ward in groups of rats trained at two 
fairly widely separated levels of 
illumination—4 and 128 ft-c—but 
found no differences between 8, 16, 
32, and 64 ftc. Similarly, Myers 
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(1962), using a wheel-turning shock 
avoidance response with rats, found a 
greater number of responses to tones 
of 80 and 100 db. than to a tone of 60 
db., but no differences either between 
the 80- and 100-db. tones or between 
65- and 85-db. buzzers. Just as 
differences may be found between 
some stimuli, but not between others, 
so a law of strength may be found in 
the case of some individuals, but not 
in the case of others. Thus Zener 
(1949), using a classical salivary 
response with food reinforcement with 
five dogs, found a law of strength 
relating tone intensity to amount of 
salivation in the case of three of them, 
but not in the case of the other two. 
In five other studies (Barnes, 1956; 
Beck, 1963; Berlyne, 1950; Brown, 
1942; Walker, 1960) which did succeed 
in demonstrating the operation of a 
law of strength, only two values of the 
stimulus were used, so that it is not 
possible to draw any conclusions from 
them about the nature of the function 
which relates conditioned stimulus 
intensity to response magnitude, other 
than that it is a positive onc. Thus, 
the only study which, besides demon- 
strating the operation of a law of 
strength, also provides information at 
a number of intensities of the condi- 
tioned stimulus is that of Kessen 
(1953). This worker presented six 
light intensities, ranging from .16 to 
2.26 log ft-c, as the CS for a wheel- 
turning response to avoid shock. He 
found that both number of CRs and 
response speed were positively related 
to CS intensity, and that the curves 
depicting this relationship rose to an 
asymptote at 1.90 log ft-c—the next 
to most intense light presented. 

In view of this paucity of Western 
evidence for what Russian investi- 
gators confidently describe as a “law,” 
and in view of the importance of this 


"Jaw"—and, especially, of the exact 


nature of the function relating CS 
intensity to response magnitude—for 
certain aspects of Pavlovian person- 
ality theory (Gray, 1964), it seemed 
worth tackling the problem anew. 

The great majority of previous 
Western studies of the effect of 
CS intensity on response magnitude 
have employed aversive stimulation 
(Barnes, 1956; Beck, 1963; Kamin & 
Schaub, 1963; Kessen, 1953; Myers, 
1962; Walker, 1960; and others). 
This is an unfortunate choice, for high 
intensities of any kind of stimulation 
may be noxious and therefore elicit 
fear responses; thus, an apparent 
effect of CS intensity on the magni- 
tude of an aversive response may be 
due to the unconditioned fear-induc- 
ing properties of the CS rather than 
to its role as a conditioned signal. For 
this reason, the present investigation 
employs an appetitive response—bar 
pressing for a food reward by the 
hungry rat. White noise, at five in- 
tensities ranging from 70 to 100 db., 
served as SP, 


EXPERIMENT | 


In at least two theoretic 
we would expect that onl 
native stimuli would affect response 
rate. Pavlovian theory (Gray, 1964) 
and the Russian empirical data (Raz- 
ran, 1957) indicate that the law of 
strength only appears when “excita- 
tion is concentrated'’—which means, 
in effect, that it is only a conditioned 
stimulus which is able to affect re- 
Sponse magnitude. The same problem 
has been raised in Western work 
on “stimulus intensity dynamism,” 
Hull's (1949) Synonym for Pavlov’s 
“law of Strength." Perkins (1953) has 
argued that stimulus intensity dyna- 
mism is due to generalization of inhibi- 
tion from an unreinforced Zero-inten- 
sity stimulus, 


al contexts 
Y discrimi- 


and therefore only 
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appears in experimental situations in 
which the reinforcement contingencies 
deliberately or unwittingly set up by 
E determine the formation of {a dis- 
crimination between a positive stim- 
ulus and the absence of that stimulus. 

Perkins’ generalization of inhibition 
hypothesis supposes a failure of dis- 
crimination between a low-intensity 
SP and an S? constituted by the 
absence of the SP. Clearly, such a 
failure of discrimination should work 
both ways: if the negative stimulus is 
presented after the positive one, it 
should elicit a higher rate of response, 
the Jower the intensity of the positive 
stimulus—a kind of law of strength in 
reverse. 

Experiment | is concerned, then, 
with the following problems: (a) Is it 
possible to demonstrate a clear-cut 
law of strength in the situation em- 
ployed? (b) Is this effect limited to a 
discriminative stimulus or does it also 
occur with nonsignal stimuli? (c) Is 
there an effect of SP intensity on 
response rate in a subsequent period 
of S^? 

Method 
Subjects 


The Ss were eight experimentally naive 
adult male albino rats belonging to the 
Maudsley Nonreactive strain (designated 
MNR by Jay, 1963) and between 116 and 160 
days old at the beginning of the experiment. 
'The Discrimination group consisted of four 
Ss (r 27, 37, 18, and 12) and the Nondiscrim- 
ination group of another four (r 9, 10, 11, 


and 20). 


Apparatus 


'The apparatus consisted of: (a) a modified 
Skinner box; (b) a system of relays and a film- 
strip timer which enabled E to program either 
fixed interval (FI) or variable interval (VI) 
reinforcement of the animal's lever-pressing 
responses, together with counters which 
recorded the number of responses and the 
number of reinforcements; and (c) à sound- 
generation system which delivered white 
noise of variable intensity to four 8-in. loud- 


speakers mounted on top of an external box 
in which the Skinner box was placed. The 
Skinner box, 8 X 13 X 8 in., contained a 
single lever and a food tray attached to the 
rear wall. A force of 7-8 gm. was sufficient 
to depress the lever. Reinforcement consisted 
of pellets weighing approximately .05 gm., and 
containing 50% rat cake meal (MRC diet 
41B) and 50% sucrose. The pellets were 
delivered to the food tray by a post office uni- 
selector. The Skinner box and the uniselector 
were placed in an external box, 8 X 16 X 16 
in., made of }-in. thick softwood. The four 
loudspeakers were rigidly mounted on top of 
the external box. No attempt was made to 
soundproof the apparatus, but the two boxes 
in which the experimental animal was placed 
were kept on their own in a relatively quiet 
room (ambient noise level did not usually 
exceed 60 db.). 

Noise intensities were calibrated by means 
of a Dawe sound-level meter, Type 1400D, 
in db. re 0.0002 dynes/cm?. For this purpose, 
the microphone of the sound-level meter was 
placed on the floor of the Skinner box, mid- 
way between the lever and the food tray, 
roughly where the animal's head would 
normally be while it was responding on the 
lever. Readings obtained with the sound- 
level meter were correlated with the readings 
obtained on a voltmeter across the amplifier 
output, both while this was being delivered 
to the loudspeakers and while it was being 
delivered to a dummy load. In this way, by 
using the voltmeter readings, it was possible 
for E, during the course of an experiment, to 
select the required intensity for the next 
period of noise (see Procedure) while no noise 
was being delivered to the Skinner box, and 
also to monitor the intensity of the noise 
actually delivered to the Skinner box. 
Weekly calibration checks showed that the 
error attached to this method of selecting the 
required noise intensity was usually within 
the limits of +1 db. 


Procedure 


"Throughout the experiment, testing was 
conducted from Monday to Friday only; no 
testing occurred at weekends, though de- 
privation schedules were maintained. The 
Ss were put on a 22-hr. deprivation schedule, 
and were fed daily between 3 and 5 P.M. 
Testing was always carried out in the morning, 
usually between 9.30 A.M. and 12 noon for the 
Discrimination group, and between 12 and 1 
p.M. for the Nondiscrimination group. 

Preliminary training.—After Ss’ weights 
had stabilized on the deprivation schedule, 
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they were introduced to the Skinner box for 
habituation and measurement of operant 
level. Each S was placed in the apparatus 
on 5 consecutive days and exposed to the 
same stimulation by noise to which it was to 
be exposed during the main. part of the ex- 
periment (see below) but without reinforce- 
ment. (Owing to a later change in the com- 
position of the groups, Rat r 12 was treated as 
a member of the Nondiscrimination group 
during habituation.) The number of times S 
pressed the lever during each period of stimu- 
lation by noise or by silence was recorded. 
There followed 3 wk. of training on the 
bar-pressing response for food reinforcement. 
By the end of the first of these 3 wk. all Ss 
were working successfully on FI 7.5 sec.; for 
the remaining 2 wk. they were on VI 30 sec., 
each daily session lasting 12.5 min. During 
this part of the experiment no noise was 
presented to Ss. Response and reinforcement 
scores were recorded at the end of every 
2.5 min. 
Main experiment.—After completion of the 
4 wk. preliminary training, the two groups 
were placed on different schedules, For the 
Discrimination group periods of noise alter- 
nated regularly with periods of silence, all 
periods lasting 2 min., 45 sec.; during periods 
of noise (SP), reinforcement was available on 
VI 30 sec.; during silence (S^), no reinforce- 
ment was delivered. A daily session lasted 
27.5 min. and consisted of five periods of VI 
and five of extinction. Sessions always began 
with VI and ended with extinction. The 
Nondiscrimination group continued on VI 30 
sec., with no interspersed extinction com- 
ponents. A session for this group consisted 
of five periods, each lasting 2 min., 45 scc. and 
each accompanied by noise; thus the session 
length was 13 min., 45 sec. For both groups 
five noise intensities were used: 70, 80, 85, 90, 
and 100 db. In a single daily session, each 
of these intensities was presented once in a 
random order determined by a 5 x 5 Latin 
square, of which days of the week formed the 
columns; periods within a session, the rows; 
and noise intensities, the letters. To ensure 
the maximum randomization, a separate 
Latin square was selected at random for each 
S each week. For the Discrimination group, 
E was able to adjust the volume control dur- 
ing a period of silence; in order to do the same 
for the Nondiscrimination group, it was neces- 
sary to turn off the noise 7.5 sec. before the 
end of each noise period and adjust the 
volume control in this interval of silence. 
Any reinforcements programed during this 
interval were delivered. Scores for the 
relevant noise period were taken when the 


subsequent intensity was turned on; a period 
of noise for the Nondiscrimination group thus, 
in fact, consisted of 2 min., 37.5 sec. noisc 
followed by 7.5 sec. silence. 

During each period of VI, five reinforce- 
ments were programed; maximum possible 
reinforcement during a single session was thus 
25 pellets, and it was rare for any S not to 
obtain all of these. All Ss were tested for at 
least 4 wk. in the ways described; two mem- 
bers of the Discrimination group (r 27 and 37) 
were tested for a fifth week as well. 

Three Ss from the Nondiscrimination 
group (r 9, 10, and 11) were tested for a 
further 2 wk. with a changed procedure. 
Session length was doubled, along with 
maximum possible reinforcements, and periods 
of noise were alternated with periods of 
silence, all periods lasting 2 min., 45 sec. and 
reinforcement being available throughout. 
Noise intensity was randomly allocated to 
period as before. 

Throughout all the various treatments, E 
recorded the number of responses and of 
reinforcements at the end of each of the 
periods into which sessions were divided. 


Results 


Operant Level and Discrimination 
Training 

Analysis of variance of the session 
totals for noise and silence obtained 
from r 27, 37, and 18 during the week 
of habituation showed that the pres- 
ence of noise in the experimental space 
had a dynamogenic effect on bar- 
pressing behavior, ; (1, 8) = 8.36, 
p «.05: the mean number of re- 
Sponses per session per S in noise was 
7.6, in silence 3.9, Analysis of vari- 
ance of the scores obtained during 
periods of noise by all eight Ss showed 
that noise intensity had no effect on 
Operant level in either group. No 
Source of variation was significantly 
different from chance in this analysis 
except Animals. A ¢ test showed that 
r 20 had a significantly higher score 
than any other 5; the other Ss did not 
differ from one another, 


There was no difference between 
the rates of response achieved by the 


STIMULUS INTENSITY 


TABL 


RESPONSE RATES PER MINU 


AND OPERANT RATE 


IE IN NOISE AND SILENCE 


Exp. II 


: | . % Increas 
wnt | NE EI 
Noise Silence | Noise Silence | Silence | Noise Silence 
r27 | 37.61 10.82 35.80 543 | 12:21 36 138 
r37 | 20.31 5.66 17.06 2.65 | 4.39 33 66 
r18 | 25.22 13.13 54.24 18.63 24.41 9 31 
r12 | 23.98 9.34 29.61 6.60 | 13.49 24 104 


two groups by the end of the training 
period, nor were rates affected dif- 
ferently in the two groups by the 
transition to the final phase of the 
experiment. 

In the Discrimination group, all 
four Ss learnt the discrimination be- 
tween noise and silence. In each case, 
response rate during noise was higher 
than response rate during silence at 
levels of significance well beyond 
p = .001 (Table 1). Furthermore, in 
the case of three of the four Ss, the 
discrimination was clearly established 
by the end of the first week: although 
the Weeks X Noise term in the anal- 
yses of variance was significant in the 
case of r 37, r 18, and r 12, ¢ tests 
showed that rate in noise was higher 
than rate in silence for all weeks of 
testing except in the case of r 18; r 18 
did not show a significant difference 
between rate in noise and rate in 
silence until Week 2. The curves for 
response rate in noise and in silence 
ach S, in Fig. 1. If it 
is clear that all Ss successfully dis- 
criminate noise from silence, the 
question still arises whether they 
succeeded in discriminating the lowest 
intensity noise—70 db.—from silence. 
To answer this question, analysis of 
variance was used to compare the 
scores obtained during the 70-db. 
periods with the mean daily silence- 
period scores (i.e., total session Scores 


are shown, for e 


during silence divided by five, since 
there were five periods of silence per 
session). The results of the analyses 
showed that the rate of responding 
during 70-db. noise was significantly 
higher, at beyond the 0.1% level, than 
the rate during silence for each S. In 
the case of r 37 and r 18, there was a 
significant interaction between Weeks 
and Noise (ie. 70 db. noise); ! tests 
showed that rate during 70 db. was 
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Fic. 1. Response rates in noise (SP) and 
silence (S3) for each member of the Dis- 
crimination group in Exp. I. 
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higher than rate during silence for all 
4 wk. in the case of r 37, but not in 
the case of r 18, which did not succeed 
in discriminating 70 db. from silence 
until the second week of testing. 


Effects of Stimulus Intensity on Con- 
current Reinforced Responding 


Figure 2 shows response rate as a 
function of noise intensity in the two 
groups; Table 2 presents, for each S 
separately and for the two groups, 
those results of the analyses of 
variance of the scores recorded during 
noise periods which concern the 
Intensity variable. As well as the 
significance of the main Intensity 
effects, Table 2 also presents the 
results of tests of the significance of 
linear regression of response rate on 
noise intensity and, where appropri- 
ate, regression equations. It will be 
seen that, in the Discrimination group, 
response rate is in all cases a signifi- 
cant positive function of SP intensity ; 
linear regression is significant in the 
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case of each S; r 27 and the group as à 
whole show a curvilinear component 
as well In the Nondiscrimination 
group, on the other hand, response 
rate remains unaffected by noise 
intensity in two Ss, and in the group 
considered as a whole. There is, 
however, a weakly significant effect of 
noise intensity in the case of two Ss 
(r 9 and 10), but in neither case does 
this effect resemble the effect of noise 
intensity in the Discrimination group: 
r 9 tends to respond more the lower 
the intensity to which it is exposed, 
while r 10 presents a curvilinear rela- 
tionship between intensity and re- 
Sponse rate, rising to a peak at 85 db. 
and then falling again, 

The effects of noise intensity in the 
Discrimination group remained the 
same throughout the experiment, as 
indicated by nonsignificant interac- 
tions between Weeks and Intensities. 
In the case of two members of the 
Nondiscrimination group, however, 
there were significant interactions of 
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FIG. 2. Response rate during noise as a function of noise intensity in Exp, I: left-hand graph, 
the Discrimination group; right-hand graph, the Nondiscrimination group. 
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TABLE 2 


EFFECTS or NOISE INTENSITY ON RESPONSE RATE DURING PRESENTATION OF NOISE: 
SIGNIFICANCE LEVELS AND LINEAR REGRESSION EQUATIONS IN EXP. I 


s Main Effect Linear Regression uoc o TOERE PEN A AN 
Discrimination Group 
r27 16.10*** 56.02*** 3.43* y = 20.8 + 0.972% 
r 37 11.58*** 44.05*** 0.76 y = 18.8 + 0.437x 
rig 321* 5.69* 2.38 y = 40.5 + 0.330x 
r 12 3.65* 11.42** 1.06 y = 45.3 + 0.243x 
Group 23.04*** 81.48*** 3.56* y = 29.9 + 0.519x 
Nondiscrimination Group 
r9 3.14* 8.41** 1.39 y = 132.3 — 0.467x 
r 10 3.40* 1.39 4.07* — 
rll 1.48 = — TM 
r 20 0.79 — — TS 
Group 0.66 | — — = 


Note.—df = 4 for the main effect, 1 for linear regression, and 3 for departures from linearity; the error variances 


have df = 48 in the case of the individual Ss (df = 60 for r 27 and r 37) and df 
In the regression equations, y = mean number of responses in a single period of noise lasting 2 min., 


analyses. 
45 sec. and x = noise intensity in db. 
*p «.05. 
** 5 <.01. 
*** p c 001. 


this kind: for r 10, F (12, 48) — 2.78, 
p < .01;forr 11, F = 3.60, p < .001. 
It was difficult to discern any sys- 
tematic trend in the data producing 
these interactions; there was, however, 
a suggestion of a predominantly 
negative relationship between stimulus 
intensity and response rate in the first 
week or two which then disappeared. 
There was only one other significant 
interaction involving Intensity in any 
of the analyses. This was an inter- 
action between Animals and Intensity 
in the analysis of the Discrimination 
group, F (12, 192) = 4.22, p .001. 
When 4 tests were carried out, it 
appeared that the most important 
differences going to make up this 
interaction. were the much steeper 
rise between 80 and 85 db. shown by 
r 27 than by any other S and the 
significant (p « .05) fall between 90 
and 100 db. shown by r 18 (Fig. 2). 
Analysis of variance of the daily 


= 192 in the case of the group 


session totals for noise and silence 
recorded for the three Ss (r 9, 10, and 
11) from the Nondiscrimination group 
exposed for an extra 2 wk. of testing 
to alternating noise and silence with 
reinforcement available throughout 
(see Procedure) demonstrated that, 
in each case, rate during noise was 
significantly higher than rate during 
silence (Table 3). When the scores 
obtained from these Ss during the 
periods of noise were submitted to 
analysis of variance, it was found that 
noise intensity did not affect response 
rate in any S, nor were there any 
significant interactions between In- 
tensity and Week of Testing. When 
the results of the three Ss were 
analyzed together, however, a signifi- 
cant effect of Intensity appeared, 
F (4, 12) = 4.16, p < .01; there were 
no significant interactions. between 
Intensity and either Weeks or Ani- 
mals, Figure 3 shows both the 
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TABLE 3 
DIFFERENCES BETWE RESPONSE RATES 
DURING NOISE AND DURING SILENCE IN 
THREE Ss FROM THE NONDISCRIMINATION 
Group EXPOSED TO ALTERNATING 
NOISE AND SILENCE 


7 


; | Rate per Minute | Rate per Minute 
Animal 


| during Silence | during Noise 
| 7 * 
9 32.14 39.79 20.80 
10 44.21 47.10 17.48* 
r it 31.53 35.46 64.66** 
Note.—d f for F values = 1/4, 
*p «.05. 
**) «.0l. 


significant Intensity effect for 
group and the nonsig 
for each S. The curvi 
Intensity effect for the 
from the figure, was confirmed by 
regression analysis, which showed that 
only the component of variation due 
to departures from linearity was 
significant, F (3, 72) = 5.52, p< 1. 


the 
nificant effects 
linearity of the 
group, obvious 
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Fic. 3. Response rate during noise as a 
function of noise intensity in three members 
of the Nondiscrimination group exposed to 
alternating noise and silence with reinforce- 
ment available in both, 
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TABLE 4 


EFFECTS or NOISE INTENSITY ON RESPONSE RATE DURING SUBSEQUENT PERIODS OF SILENCE: 
SIGNIFICANCE LEVELS AND LINEAR REGRESSION EQUATIONS IN Exp. I 


$ Main Efect Linear Regression Pine REGS Equation 
Discrimination Group 
r 27 2.46 6.50* 1.11 
r 37 0.56 0.52 0.57 
r 18 9.78*** 45.7309 1.13 
r 12 2.91* 11.14** 0.17 
Group 10.92*** 33:56" 3.36* 
Nondiscrimination Group 
r9 0.80 0.03 1.05 — 
r 10 0.64 2.33 0.07 — 
eT 1.41 4.56* 0.36 y = 114.1 — 0.323x 
Group 1.20 2.40 0.80 i — 


Note.—df = 4 for the main effect, 1 for linear regression, 


ances have df = 60 in the case of r 27 and r 37, 48 for r 


crimination group, and 72 for the Nondiscrimination group. 


and 3 for departures from linearity; the error vari- 
18 and r 12, 24 for r 9, r 10, and r 11, 192 for the Dis- 
In the regression equations, y = mean number of 


responses in a single period of silence lasting 2 min., 45 sec. and x = preceding noise intensity in db. 
< 


p «.05. 
wep EDL 
** p < 001. 


silence for the Discrimination group 
and for the Nondiscrimination group 
during exposure to alternating noise 
and silence. Table 4 shows the 
significance levels of the Intensity 
effects illustrated by the curves in 
Fig. 4. It will be seen that, in the 
Discrimination group, every S except 
r 37 shows a significant linear negative 
relationship between SP Intensity and 
subsequent Sê response rate. The 
same pattern also appeared in the 
analysis of the Discrimination group 
as a whole, although, as well as the 
linear component of variation, there 
were significant departures from lin- 
earity, apparently due to a steeper 
decline in response rate between 85 
and 100 db. than between 70 and 85 
db. The group analysis also revealed a 
significant Animals X Intensity inter- 
action, F (12, 192) = 3.04, p < .001; 
this seems to be due to the particu- 
larly marked effect of SP intensity on 


S? response rate displayed by r 18 and 
the lack of any effect in r 37. The 
curves for the Nondiscrimination 
group are not dissimilar to those for 
the Discrimination group; however, 
only in the case of r 11 was the effect 
of noise intensity weakly significant 
when linear regression was taken into 
account. Since there were no signifi- 
cant effects of noise intensity, either 
alone or in interaction, in the group 
analysis, the most likely conclusion is 
that this variable, when it has no 
signal properties, has no real effect 
on response rate in silence. 


EXPERIMENT lI 


Pavlovian theory (Gray, 1964) 
predicts that the function relating 
stimulus intensity to conditioned re- 
sponse magnitude is itself a function 
of certain other variables. Among the 
most important of these are drive 
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level and individual differences along 
a dimension of personality termed the 
“strength of the nervous system. 
The positive relationship between 
stimulus intensity and response mag- 
nitude is supposed to hold only up 
to a limiting value of stimulus in- 
. tensity; beyond this point, further 
increases in stimulus intensity are 
reported to cause a decrement (or, at 
least, to cease causing an increase) in 
response magnitude. This limiting 
value of stimulus intensity is known 
as the “threshold of transmarginal 
inhibition." Pavlovian theory holds 
that this threshold will be lower, the 
higher the drive level, Experiment II 
reports à test of this hypothesis: 
curves relating SP intensity to operant 
response rate are obtained from the 
same Ss under two levels of drive 
which differ both from one another 


and from the drive level obtaining 
in Exp. I. 


Method. 
Subjects 


The Ss were the four members of the Dis- 
crimination group and r 20 from the Non- 
discrimination group used in Exp. I. All Ss 
were placed on the new feeding schedule 
described in the procedure below. In the 
case of r 27 and 37, 1 wk. with no testing of 
any kind, but with the new feeding schedule 
in operation, intervened between the two 
experiments; for the other three Ss there was 
only the usual weekend break, during which 
the new schedule was commenced. 


Procedure 


The apparatus ha: 
I. The reinforcemen 
as before. 


All Ss were placed on a new 
schedule. On this Schedule the 
for 1 hr. a day only, either between 9 and 10 
A.M. or between 12 and 1 Py. On days on 
which S was fed between 9 and 10 AM., it was 
tested between 12 and | P.M.; these consti. 
tuted the "low-drive" days. On days on 
which .S was fed between 12 and 


1PM,, it Was 
tested between 9.30 and 10.30 A.M. ; these con- 


s been described in Exp. 
t contingencies remained 


deprivation 
Y were fed 


stituted the "high-drive" days. Thus, if we 
adopt, with some slight changes, Eisman's 
(1956) symbolism, in which k = hours since 
last feeding and h; = hours per 24 hr. during 
which the animal is allowed ad-lib. access to 
food, then: 

In Exp. I, k = 16.5 — 19.5, hy = 2 


In the high-drive condition of Exp. TI, 
h = app. 24, hy 


Hl 


In the low-drive condition of Exp. II, 
h = app. 2, hi = 1. 


For each .S, low- and high-drive days alter- 
nated regularly ; the first day of the first week 
of testing was a low-drive one for r 27 and r 37 
and a high-drive one for r 18, r 12, and r 20. 
Apart from the change in drive conditions, 
the procedure was the same as in Exp. I. 
Random presentation of noise intensities 
during the periods into which daily sessions 
were divided was again achieved by preparing 
a 5 X 5 Latin square for cach animal. Each 
Latin square was now, however, repeated for 
2 consecutive wk. In this way, each column 
of the Latin square was presented both on a 
low-drive day and on a high-drive day. For 
example, if H — high drive and L = low 
drive, and the numbers 1-5 stand both for the 
five columns of the Latin square and the 5 
days of the week on which testing was con- 


ducted, then a typical 2-wk. testing sequence 
would read: 


ILHLHLHLHLH LHL 

12345 12345 
Over the weekend, although no testing oc- 
curred, the alternation of feeding times con- 
tinued. The experiment was continued for 
4 wk., allowing one replication of the testing 
sequence described above; a different Latin 
Square was used for each replication. 


Results 
Effect of Drive Level on Response Rates 


The method adopted for varying 
drive level was successful in the case 
ofallSs. In each case, response rates 
in both noise and silence were higher 
on high-drive days than on low-drive 
days at levels of significance beyond 
P — .001 (by analysis of variance). 
Mean response rates under the two 
conditions are shown for the Dis- 


crimination group in Table 1. As 


STIMULUS INTENSITY AND OPERANT RATE 19 


might be expected on the assumption 
that time since last feeding (4) is the 
main determinant of drive level, the 
response rates recorded in Exp. | 
fall between the values recorded in the 
two drive conditions of Exp. II in the 
case of r 27 and 37 for SP responding 
and r 27 and 12 for S? responding. 
Rat r 20 also displayed this ordering 
of response rates (35.93 responses/min 
in the low-drive condition, 41.18 in 
Exp. l, and 48.48 in the high-drive 
condition). However, both the low- 
and the high-drive conditions gener- 
ated higher response rates than those 
recorded in Exp. I during both SP 
and S? in the case of r 18 and during 
SP only in the case of r 12. This 
overall increase in response rate could 
be due to continued experience of the 
experimental situation, to the shorter 
time allowed for adaptation to the 
new feeding schedule, or to a par- 
ticularly marked effect in these Ss 
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Fig. 5. Response rate as 


left-hand graph, SP response rat 


of the reduction in number of hours 
feeding per 24 hr. (//; Eisman, 1956). 
In the case of r 37, due, no doubt, to 
continued nonreinforcement, S^ re- 
sponse rate was lower in both drive 
conditions than in Exp. Il. All these 
differences between response rates in 
the two experiments were fully signifi- 
cant, as determined by analysis of 
variance. 


Effects of Noise Intensity and Drive in 
the Discrimination Group 

Figure 5 presents the curves relating 
both SP and S? response rates to SP 
intensity for each S in the Discrimi- 
nation group. Table 5 shows the 
results of the relevant significance 
tests from the analyses of variance 
together with the linear regression 
equations. It can be seen that the 
results of this experiment confirm both 
the positive effect of SP intensity on 
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TABLE 5 


EFFECTS OF SP [NTENSITY ON SP AND SUBSEQUENT S? RESPONSE RATES IN ENP. II: 
SIGNIFICANCE LEVELS AND LINEAR REGRESSION EQUATIONS 


» - Departures from " s 2 
S Main Effect Linear Regression Linearity Regression Equation 
E F F 
i SP Response Rates 
4.75** 15:62*** 1.13 y — 70.1 + 0.545x 
r$ 4.65** 18.11*** 0.17 y= 30.5 + 0.284x 
r 18 2.67 2.65 2.68 — 
r12 Tote 19,39 *e* 3.35* y = 63.6 + 0.327x 
Group 9,85» 31,601*** 1.26 y = 74.5 + 0.353x 
s^ Response Rates 
r27 2.34 | 6.39* 0.98 | y= 46.7 — 0.269x 
r 37 1.41 4.25* 0.46 y = 18.0 — 0.098x 
r 18 11.19% 40.61*** 0.81 y = 192.9 — 1.561x 
r12 4.40** 15.16*** 2.80 y= 61.8 — 0.402x 
Group 23.62*** | 03,25*88 0.41 y= 80.6 — 0.594x 
Note.—d f Ur the main cfiect, 1 for linear regression, and 3 for departures from linearity; the error vari- 
int 


ances haved f = he case of the indi 


* 
**5 «01. 
Dp c 001. 


SP response rate and the negative 
effect of SP intensity on subsequent 
S^ response rate observed in Exp. 1, 
In general, the data suggest that both 
relationships are linear over the range 
of SP intensities studied. However, 
linear regression of SP response rate 
on SP intensity is no longer signifi- 
cantly different from chance in the 
case of r 18. In contrast, linear re- 
zression of S^ response rate on preced- 
ing SP intensity is now significant in 
the case of r 37, although it was not 
significant in Exp, I, 

An effect of drive on the law of 
strength (predicted by Pavlovian 
heory) would be expected to appear 
n the present experiments in the form 
f an interaction between Drive and 
ntensity in the analyses of SP re- 
ponses. However, this interaction 
ras not significant in any of these 
nalyses, whether of the individual Ss 
r of the group as a whole. Indeed, 


l vidual Ssand df = 192 in the case of the group analy 
equations, y F mean number of responses in a single period lasting 2 min., 45 sec. and x 
5 


, Inthe regression 
oise intensity in db. 


only one interaction of any kind in- 
volving Intensity was significant, 
This was an interaction between 
Intensity and Replication (Le. first 
2 wk. of testing vs. second 2 wk.) in 
the case of r 18, F (4,48) = 4.53, 
P «.01. Further analysis of this 
interaction by ¢ tests showed that, in 
the first fortnight, rate at 100 db. was 


TABLE 6 


COEFFICIENTS OF LINEAR REGRESSION or SD 
RESPONSE RATE ON SD INTENSITY AND 
or S? RESPONSE RATE ON PRECEDING 

SP INTENSITY AS A FUNCTION 
or DRIVE IN Exp, II 


S Response Rate sà Response Rate 


S |- 


Low Drive | High Drive| Low Drive High Drive 


r27| +0.609 | +0.481 | —0.237 — 0.301 
r37| +0.339 | +0.229 | —0.022 —0.174 
r18| +0.292 | +0.226 | —1.603 | — 1.613 
r12| 4-0.403 —0.306 | —0.498 


+0.250 
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significantly lower than at either 85 
or 90 db. ; in the second fortnight, rate 
at 70 db. was significantly lower than 
at any other intensity ; at all intensi- 
ties, rate during the second fortnight 
was higher than rate during the first 
fortnight. 

In spite of the nonsignificant Drive 
X Intensity interactions, the slopes of 
the curves relating response rate to 
SP intensity were calculated sepa- 
rately for each drive condition in an 
eflort to see whether, after all, any 
systematic changes had been induced 
by the change in drive level. The re- 
sults of these calculations are pre- 
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sented in Table 6. It will be seen 
that, in the case of all four Ss, the 
slope of the curve relating SP response 
rate to SP intensity is lower in the 
high-drive condition and the slope 
relating S? response rate to SP in- 
tensity is higher in the high-drive 
condition. Figure 6 presents the 
relevant curves for SP responding, to- 
gether with the corresponding curves 
from Exp. 1. 


Effects of Noise Intensity and Drive in 
Subject r 20 


Although noise intensity did not 
affect r 20's response rate in Exp. I, 
a significant Intensity effect appeared 
in Exp. 11, F (4, 48) = 3.79, p < .01. 
This was accompanied by a significant 
interaction. between Intensity and 
Replication, F (4,48) — 3.00, p « .05. 
Further analysis of this interaction 
by £ tests showed that the effect of 
noise intensity was almost entirely 
due to a very high response rate 
during 100-db. periods in the first 
fortnight only; there were no signifi- 
cant differences between intensities 
in the second fortnight. There was no 
interaction between Drive and In- 


tensity. 
DISCUSSION 


The most important result to emerge 
from both Exp. I and Exp. II is that the 
intensity of a discriminative stimulus has 
a definite and very consistent effect on 
operant response rate. If we consider 
the four Ss as four replications of each 
experiment, then in all eight replications 
a significant effect of SP intensity on 
operant response rate was found, and, in 
all but one case, this effect was of the 
same kind: as stimulus intensity in- 
creased, so did response rate. It is not 
possible to be quite so definite about the 
exact shape of the function relating SP 
intensity and operant response rate, but 
most frequently it has been monotonic 
and linear when stimulus intensity is 


22 


expressed in decibels. This was the 
finding in five of the eight replications. 

Although the effect of SP intensity on 
response rate in the present experiments 
was highly consistent, it should be noted 
that, quantitatively, it is not a very 
marked phenomenon: the data gained in 
these experiments suggest that, on the 
average, a rise of 10 db. in the intensity 
of the SP is sufficient to increase response 
rate by only about 1.5 responses a 
minute. It is doubtful whether the 
customary scrutiny of cumulative rec- 
ords, without the aid of the kind of 
statistical tool which has been used in 
the present study, would have been able 
to establish the reality of this effect. 

A very different picture emerges when 
we consider the effects of nonsignal noise 
intensity. The most frequent finding 
has been that response rate is unaffected 
by the intensity of nonsignal noise. Such 
effects as did appear were sporadic, oc- 
curring in some Ss but not in others, in 
the group as a whole but not in any 
single S, or in some weeks but not in 
others. They were also irregular, show- 
ing little systematic trend on the occa- 
sions when they did occur and bearing 
very little resemblance either to the 
effect of SP intensity or, indeed, to one 
another. Thus, r 9 in Exp. I (Fig. 2) 
presented a negative relationship be- 
tween noise intensity and response rate, 
r 10 in the same experiment (Fig. 2) and 
the group of three Ss exposed to alternat- 
ing noise and silence (Fig. 3) showed a 
curvilinear relationship, and r 20 in Exp. 
II showed a positive relationship entirely 
due to a very high response rate at 100 
db. in one fortnight only. 

It seems safe to conclude, then, that 
the law of strength, or stimulus intensity 
dynamism, operates only in the case of 
discriminative stimuli. It should, how- 
ever, be pointed out that this conclusion 
is not entirely beyond criticism. As well 
as the difference in the signal value of the 
stimulus, the treatments accorded the 
Discrimination and Nondiscrimination 
groups also differed in other ways which 
might conceivably be of importance. In 
particular, there was no interruption of 
VI reinforcement by extinction com- 
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ponents for the Nondiscrimination group, 
and therefore rate of reinforcement was 
twice as great for this group. 

The data allow us to discount other 
differences between the groups. Thus 
the slight difference in drive level in 
Exp. I (since the Nondiscrimination 
group was always tested after the Dis- 
crimination group) cannot account for 
the absence of a law of strength with 
nonsignal noise, in view of the demon- 
stration in Exp. II that much greater 
differences in drive level may alter the 
slope of the stimulus intensity curve but 
do not alter its shape. Similarly, the 
slightly higher response rate recorded by 
the Nondiscrimination group in Exp. I 
(Fig. 2) cannot be of importance since 
the Discrimination group still displayed 
a law of strength with much higher 
response rates in Exp. II. 

The fact that stimulus intensity dy- 
namism appears only with signal stimuli 
is in agreement with Perkins' (1953) 
hypothesis accounting for this phenom- 
enon in terms of generalization of 
inhibition from the unreinforced zero- 
intensity stimulus. However, the im- 
portance of the signal function of the 
stimulus could be due to other causes 
than those postulated by Perkins, Pav- 
lovian theory also suggests that only 
signal stimuli are concerned in the law of 
strength. A further possibility stems 
from recent work on the neurophysio- 
logical processes involved in habituation 
(Hernández-Peón, 1961; Sharpless & 
Jasper, 1956): it could be suggested that 
the intensity of a nonsignal stimulus 
fails to affect the magnitude or rate of 
conditioned responding because it is 
either blocked before it reaches the CNS, 
or because it does not get sufficient 
"boost" from the reticular arousal sys- 
tem to affect cortical functioning. More 
convincing support for Perkins’ hy- 
pothesis comes from the effect of SD 
intensity on subsequent S4 response rate 
observed in the present experiments, 
The most likely explanation for this effect 
(also probably confined to signal stimuli) 
is that it is due to generalization of 
excitation from low intensities of the 
positive stimulus to the zero-intensity 
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negative stimulus. This explanation, 
together with Perkins’ account of stim- 
ulus intensity dynamism, leads us to 
expect that the positive effect of SP 
intensity on SP response rate and the 
negative effect of SP intensity on S* 
response rate will occur together (as they 
did in these experiments), since both 
arise from the same failure to discrimi- 
nate between positive and negative 
stimuli. 

The failure of any of the interactions 
between Drive and Intensity (Exp. II) 
statistical significance is 
disappointing from the standpoint of 
Pavlovian theory (Gray, 1964). This 
predicts that an increase in drive should 
produce either a curvilinearity in the 
curve relating response strength to 
stimulus intensity or an asymptote at 
higher stimulus intensities. However, 
the experiments did produce evidence 
that something in the nature of the latter 
process was occurring, since all four Ss 
displayed a lower slope of the stimulus 
intensity/response rate curve in the high- 
drive condition. Other evidence in favor 
of Pavlovian theory comes from a con- 
sideration of the results achieved with 
r 18. This S displayed a much greater 
rise in response rate from Exp. I to Exp. 
II than from the low-drive to the high- 
drive condition of Exp. II (Table 1). 
Assuming that this represents an increase 
in drive (due, no doubt to the reduction 
in h,), it is in agreement with Pavlovian 
theory that linear regression of response 
rate on SP intensity was no longer sig- 
nificant under the higher drive level of 
Exp. II in this S. 

A consideration of the individual 
differences observed in these experiments 
also offers some support for Pavlovian 
theory. According to this theory (Gray, 
1964), the absence of a law of strength, 
or the presence of curvilinearities 1n the 
law of strength curve such that highest 
response strength is reached „at some 
point below maximum stimulus intensity, 
are signs of “weakness of the nervous 
system." Such signs were shown in both 
experiments by © 18, suggesting that 
they are a stable feature of this individ- 
ual's responding. Some indication of 


to achieve 


similar tendencies appears in the high- 
drive curve obtained from r 27 in Exp. II 
(Fig. 6). Thus, of the four Ss studied 
r 18 could be judged the “weakest” and 
r 27 the next weakest. Now, there are 
grounds for believing that "weak" ani- 
mals are individuals which are particu- 
larly easily stimulated to the point of 
giving their maximum possible response 
(Gray, 1964). It is therefore consistent 
with the predictions of Pavlovian per- 
sonality theory that these two Ss had 
much higher response rates than r 37 and 
r 12 (means for Exp. I and II combined : 
r 18—41.0 responses/min; r 27—39.9; 
r 12—28.6; r 37—20.1). Considering the 
law of strength as a joint function of 
drive and individual differences (as 
Pavlovian theory demands), it can be 
seen from Fig. 6 that the irregularities in 
this effect become much more marked as 
response rate increases and that the 
nature of these irregularities is, generally 
speaking, of the expected kind. 
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The effect of nonrewarded trials which preceded any rewarded trials 


was studied with rats in a straight runway. 
measured for groups given either 


Running speeds were 


24 nonrewarded trials preceding 2+ 


rewarded (24N-24R), 24 nonrewarded trials interspersed among 24 
rewarded (50%), 24 rewarded trials (24R), or 48 rewarded trials (48R). 
At the end of acquisition, the mean speeds were ordered as: 50% > 48R 
> 24R > 24N-24R, with Group 24N-24R showing marked forgetting 


on the first trial of each day after reward was introduced. 


During 


extinction the ordering was: 50% > 48R > 24R > 24N-24R on the 


first trial of each extinction day, 


but 50% > 24N-24R > 48R = 24R 


on the last trial of each extinction day. 


The present experiment investi- 
gated the effect of nonrewarded trials 
prior to the introduction of reward on 
running speed during subsequent ac- 
quisition and extinction. In addition, 
the effects of these initial nonrewarded 
trials were compared with the effects 
of an equivalent number of ,non- 
rewarded trials which were inter- 
spersed among rewarded trials. 


METHOD 


General design.—The experiment included 
four groups, each of which was given an 
acquisition and an extinction session in a 
straight-alley runway. Of basic interest was 
the comparison made between the extinction 
performance of a group given 24 nonrewarded 
trials in the runway followed by 24 rewarded 
trials (Group 24N-24R), and a group given 
48 trials, all of which were rewarded (Group 
48R). In order to establish the effects of the 
nonrewarded trials per se, a third group was 
also included which received only 24 rewarded 
trials (24R). The fourth group vn, 4 
standard partia 1 group W oe 
received 24 nonrewarded trials intersperse 
among 24 rewarded trials | 
Following acquisition training, 
1 This research was supported by Grant 
G-8706 from the National Science Foundation. 


extinction were given per day for 4 successive 
days. 

Subjects and apparatus.—The Ss were 60 
(15 per group) naive female albino rats of the 
Sprague-Dawley strain, 90 days old at the 
start of runway training. The apparatus was 
the 4-ft. enclosed runway described by Lewis 
(1956). In this runway, running time is 
measured from the opening of the start-box 
door until S enters the goal box. 

Procedure.—Prehandling and feeding sched- 
ule were similar to that employed by Hill and 
Spear (1962), except Ss were (a) prehandled 
without access to food pellets and (b) never 
presented with food sooner than 15 min. after 
S had been handled by E. These modifica- 
tions were introduced to separate the eating 
and handling experience prior to runway 
training. 

Eight Ss, two from each of the four groups, 
were run in rotation with a minimum interval 
of 3 min. between successive trials (eight per 
day for each S). During the first 3 days of 
training, 5s in Group 24R were handled near 
the runway for a period of time equivalent to 
that experienced by the other 5s. They were 
not, however, run in the runway until the 
rewarded trials of Group 24N-24R began 
on Day 4. 

The schedule of reinforcement was the 
same for all Ss in Group 50% and was 
modified from that employed by Wagner 
(1961). The daily sequences were as follows: 
+-+--++-, —HdE——- T 
are, Ee cse: 
+--+; p a a A 
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During all trials, the start-box door was 
opened as soon as .S was oriented toward it. 
Following a run, S was confined in the goal 
box for 10 sec. On rewarded trials, two .045- 
gm. Noyes pellets were present in a metal food 
cup in the goal box. T he food cup was 
removed on nonrewarded trials. 


RESULTS AND DISCUSSION 


The course of acquisition is shown 
in Fig. 1. Group 24R which received 
only 24 trials, all rewarded, is shown 
on the graph from Trials 25 to 48. 

Three facts are of particular interest 
with respect to these acquisition 
speeds. (a) Group 24N-24R, which 
received 24 nonrewarded trials preced- 
ing its 24 rewarded trials, demon- 
strated marked forgetting on the 
initial trials of the second and third 
days of acquisition, and at no time 
gave any indication of latent learning. 

(b) When the average speeds of these 
two groups were compared by Dun- 
can's multiple-range test (Edwards, 


RUNNING SPEED (FT/SEC) 
m 


Fic. 1. Mean running speed for each acquisition trial. 


1960) on the last day of acquisition, 
Group 24K was found to have run 
faster than Group 24N-24R (p < .05). 
lt is clear, however, that this differ- 
ence was due to the forgetting ex- 
hibited by Group 24N-24R and re- 
flected in the first few trials of the day. 
(c) Group 48R, which was given 48 
trials, all rewarded, was considerably 
faster (p «.001) at the end of 
acquisition than either of Groups 24N- 
24R or 24R. It was, however, slower 
than Group 50% (p < .001). This 
latter finding is consistent with the 
results of several investigators (see 
Spence, 1960). 

Extinction curves for the four 


‘groups are shown in Fig. 2. The 


vertical lines represent a 24-hr. in- 
terval between trials. It is of interest 
to note how the speed of Group 24N- 
24R, whose first 24 runway trials were 
not rewarded, dips after a 24.hr. 
interval as it did in acquisition. Per- 
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TRIALS 


à (The vertical lines 
represent a 24-hr. interval between trials.) 
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RUNNING SPEED (FT/SEC) 


Fic. 2. 


haps more important is the increase 
in speed from the first to the second 
trial of a day even though the first 
trial was not rewarded. This pattern 
is clearly different from that of the 
other three groups which all showed 
marked spontaneous recovery on the 
first trial of each day and then de- 
creased consistently. Because of these 
sharp and interesting differences in 
the pattern of running within a day, 
comparisons were made of speed on 
both the first and the last trials of each 
extinction day, and also im terms of 
trials to an extinction criterion. 
A Groups X Days analysis of vari- 
1 on daily first-trial 


ance was performe 
n. As would be 


speeds in extinction. / 
expected, the overall effect of groups 


was significant, F (3,56) = 289.95, 
p < .001, as was the decrease 1n speed 
from day to day, F (3, 168) = 331.29, 


Mean running speed for each extinction trial. 
represent à 24-hr. interval between trials.) 


t2 
N 


(The vertical lines 


p < .001. In addition, the inter- 
action of days with groups was signifi- 
cant, F (9, 168) = 52.33, p < 001. 
This interaction undoubtedly reflected 
the effect of the truncated distribution 
of speeds in extinction, but the lack of 
a speed change from day to day within 
Group 24N-24R contrasted with the 
consistent drop of the other groups, 
and was also a component of this 
interaction. Itis clear in Fig. 2 that 
first-trial speed of this group was 
essentially constant throughout ex- 
tinction. Individual comparisons of 
the overall group means by Duncan's 
multiple-range test showed that all 
differences were significant at the .001 
level, with an ordering from fast to 
slow speeds of: 50%, 48R, 24R, 
24N-24R. 

As can be seen in Fig. 2, the rela- 
tionships of the groups changed rather 
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rapidly within a day after the first 
trial. In particular, Groups 24R and 
48R that had never previously re- 
ceived nonrewarded trials dropped 
below the other groups by the end of 
the day. In terms of speed on the 
last trial of each day, a Groups 
X Days analysis of variance revealed 
that the effects of groups, F (3, 56) 
= 2418, p « .001; days, F (3,168) 
= 11.45, p < .001; and their inter- 
action, F (9, 168) = 3.34, 5 « .001, 
were all significant. Comparison of 
speeds on the last trial of each ex- 
tinction day summed over days (Dun- 
can's test) showed that the mean 
speed of Group 50% exceeded that of 
Group 24N-24R (p < .001), and that 
both of these were faster than Groups 
24R and 48R (p < .001). There was 
no difference between the latter two 
groups. These same relationships 
prevailed with a post-hoc trials-to- 
criterion measure. A criterion of two 
successive “slow” trials with travers- 
ing time greater than 20 Sec. was 
employed since many Ss in Groups 
50% and 24N-24R never attained a 
more stringent criteria. Employing 
Duncan's multiple-range test, it was 
found that Group 5095 took more 
trials to extinction than Group 24N- 
24R (p < .05), which in turn took 
more (p —.01) than the other two 
£roups, which did not differ, 

Finally, the number of trials on 
which an 5 did not enter the goal box 
within the maximum allotted time of 
2 min. should be mentioned. Here 
again the same relationships prevailed. 
Every S within Groups 24R and 48R 
made at least one such nonrun, but 
only 4 of the 15 Ss in Group 24N-24R 


and no Ss in Group 50% made any 
nonruns. 


Thus, the two major sets of facts ob- 
tained with this experiment concern the 
effect of initial nonrewarded trials on 
(Fact 1) daily first-trial running speed 


after the introduction of reward, and 
(Fact 2) running speed in subsequent 
extinction. These will be discussed in 
order. 


1. Hungry Ss, whose initial runway 
experiences did not include food reward, 
ran at their initial speed (relatively slow) 
early on each day even after reward was 
introduced. This regression to earlier 
speeds was quite transient within days 
and disappeared after the first few trials. 
Between days, however, this regression 
persisted through 3 days of rewarded 
trials, and was still evident during the 
first 2 extinction days. It is suggested 
that this phenomenon be viewed as 
greater forgetting between days by 
Group 24N-24R, relative to the other 
£roups, due to the recovery of habits 
established during the initial nonre- 
warded trials. The recovery of the 
slower Tesponse that had been appro- 
priate on earlier trials interfered with the 
faster speed of running which was more 
consistent with the later current reward. 
Thus, the reduced speed by Group 24N- 
24R on the first trial of each day after 
reward was introduced is viewed within 
the context of the traditional proactive 
interference paradigm. This interpreta- 
tion is consistent with the results of 
Collier, Knarr, and Marx (1961) who 
studied the effects on running speed of 
shifts in Sucrose-concentration reward. 
These writers (Collier et al, 1961) note 
the suggestion from their data that, 
“. . . The rat starts each day at a level 
commensurate with the previous rein- 
forcement [p. 494].” Hill and Spear 
(1963) have also reported this phe- 
nomenon. 

2. Nonrewarded trials, which were 
given prior to a series of trials in which 
food reward was presented for the first 
time, had the effect of increasing asymp- 
totic extinction speed and of extending 
the number of trials to a criterion of 
extinction. The importance of the 
Patterning of the nonrewarded trials was 
also evident, since even greater asymp- 
totic extinction speed and more trials to 
an extinction criterion were found when 
rewarded trials were interspersed among 
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the nonrewarded trials. It should be 
clear that the conclusion regarding 
asymptotic speed depends upon ex- 
clusive consideration of trials occurring 
late in an extinction day. Furthermore, 
it is not inconceivable that the differences 
at asymptotic extinction might have 
disappeared with more extinction trials. 
Nevertheless, the trials-to-criterion data 
and the shifts of group ordering from 
acquisition to extinction seem to com- 
pensate for such measurement problems. 

The apparently increased resistance 
to extinction of Group 24N-24R would 
appear to create interpretation problems 
for certain theories of the partial-rein- 
forcement extinction effect. For ex- 
ample, Amsel (1958) clearly requires that 
S must have built up a substantial ex- 
pectancy (i.e. sufficiently strong ro-s) 
before experiencing the nonreward in 
istance to extinction is 


acquisition, if resis 
to be increased. Lawrence and Fest- 


inger's view, though not completely un- 
ambiguous, apparently does not make 
this requirement (Lawrence & Festinger, 
1962, pp. 113-115). However, Lawrence 
and Festinger's concepts would need to 
be extended to account for the superior 
extinction performance of Group 50%. 
There appears to be no obvious satis- 
factory explanation for the effect of the 
initial nonrewarded trials on extinction 
behavior. It is likely that such an ex- 
planation will be found only after more 
is known about the precise nature of FTE 
behavior during the initial nonrewarded 
trials. Perhaps it is not proper to con- 
sider the running behavior of Group 


24N-24R within the same context of 

resistance to extinction” as is employed 

elsewhere. g 
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INFLUENCE OF RESPONSE DISCRIMINABILITY 


ON STIMULUS DISCRIMINABILITY ! 


J. F. RICHARD 


University of Rennes, France 


2 experiments tested the following hypothesis: if 2 stimuli are equivalent 
if they are followed by the same event, or by 2 physically different but 
not discriminable events, we can expect that the differentiation of a set 
of stimuli will be poorer if they are associated with events (or responses) 


of low discriminability. Experiment I used response cards showing 
varying number of dots, in high to low d 


associated with circles of varying diameter. 
rectangles of varying height were associ 
Results of these experiments led to the co 
bility of a set of stimuli depends on the dis 
with which they have been previously 


Several studies have shown that 
two stimuli become less discriminable 
if they are associated with the same 
response; this effect is called acquired 
equivalence of cues (De Riviera, 1959; 
Dollard & Miller, 1950; Grice & 
Davis, 1958, 1960; Jeffrey, 1953; 
Kurtz, 1955; Lacey, 1961). This prin- 
ciple may be applied when responses 
are not easily discriminable. We may 
assume that in this case S will not 
perceive as many responses as there 
are in fact. For instance, if there are 
eight different responses, S$ may per- 
ceive only five or six different ones. 
Let us suppose that eight stimuli are 
to be associated with these eight 
responses; from S's point of view it is 
as if these stimuli were to be associated 
with five or six responses. We then 
have a situation in which a set of 
stimuli is associated with a smaller set 
of responses, the typical situation for 


l'This paper is a part of a "Doctorat de 
Troisiéme Cycle" thesis (Richard, 1963) 
presented by the author at the University of 
Paris, May 1963. This research has been 
conducted under the supervision of P. Fraisse, 
to whom we are grateful for his helpful com- 
ments. The experiments have been conducted 
at the Laboratoire de Psychologie of the U 


ni- 
versity of Rennes, 
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liscriminability condition, 
In Experiment II, 
ated with motor responses. 
nclusion that the discrimina- 


criminability of the responses 
associated. 


acquired equivalence of cues. There- 
fore, we can expect that decreasing 
response discriminability will have the 
same effect as reducing the number of 
responses—that is, a decrease of 
stimulus discriminability. Such a 
result would mean that stimulus dis- 
criminability is dependent on response 
discriminability. Two experiments 
have been conducted to test this 
hypothesis. 


EXPERIMENT I 
Method 


Materials.—The stimuli 
black circles on white cards. 
circles varied from 40 to 6 
arithmic progression, 
sets of dots were drawn at random in a matrix 
of 64 cells, Cards with 8, 11, 14, 18, 24, 32, 
42, and 55 dots were used in the high dis- 
criminability response condition, and cards 
with 20, 22, 24, 26, 29, 32, 35, and 38 dots 
were used in the low discriminability response 
condition. 

Stimulus and response cards were presented 
by hand through an opening in the center of 
à board. Presentation time was about 10 sec. 

In both conditions circles of increasing size 
were associated with response cards with 
decreasing numbers of dots in perfect cor- 
relation. 

Procedure.— he Ss were told that they 
Were going to see circles of different size 


used were eight 
Diameters of the 
4 mm. in a log- 
On the response cards 
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followed by cards with varying numbers of 
dots on them. ‘The circles were supposed to 
represent marksmen and the cards the target 
at which they had fired. The Ss had to judge 
the marksmen’s skill from the number of dots 
on the target, each dot representing à success- 
ful shot. The Ss also had to rate the marks- 
men by skill. 

The experiment began by presenting each 
circle twice at the same time as the respective 
target in a random order. Afterwards the 
circle was presented first and S wrote his 
response, then the target was presented. The 
experiment was continued for eight trials, each 
stimulus and each response being presented 
eight times. At the end of the experiment Ss 
were asked how many circles they thought 


there had been. 
The experiment was carried out in a group. 


The Ss were college students, 16 in each 
condition. 


Results 


We shall analyze three types of 
data: (a) judgments about number of 
circles used in the experiment ; (b) 
number of different categories used to 
indicate the position of the marksmen ; 
(c) transmitted information as à 
measure of stimulus discriminability. 

The median of the distribution of 
the estimated number of circles is 6.8 
in the high discriminability response 
condition and 5.8 in the low dis- 
criminability response condition. A 
median test shows this difference to 
be significant beyond the .05 level. 
The Ss in the low discriminability 
response condition have thus per- 
ceived on average a circle less than 
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those in the high discriminability 
response condition. 

For the number of categories used 
to rate the marksmen, the correspond- 
ing values are 7.0 and 6.5. This 
difference is consistent with the previ- 
ous one but is not statistically 
significant. 

We have now to see if stimulus dis- 
criminability is lower when responses 
are weakly discriminable. We have 
computed transmitted information for 
each set of Ss which used the same 
number of categories in rating the 
marksmen in each condition. Since 
the number of dots on the cards is 
perfectly correlated with the size of 
the circles, the task of indicating the 
marksmen's position as regards their 
skill is in fact a task of identification 
of the stimuli. Some Ss were dropped 
because too few had used the same 
number of categories. The data of the 
last four trials have been used for 
this computation. As is shown by 
Table 1, transmitted information is 
higher in the high discriminability re- 
sponse condition than in the low dis- 
criminability response condition. As 
the value obtained for each category 
provides an independent estimate of 
transmitted information, the proba- 
bility of such an arrangement of the 
transmitted values in the two condi- 
tions may be calculated as follows: if 
we call A each estimate of T in the 
high response discriminability condi- 


TABLE 1 
TRANSMITTED INFORMATION FOR EacH CONDITION 
No of Categories | No, of Ss porro Mir ae 
High discrimin- 6 $ im 187 
a 7 5 x 
ability response 2 1198 
cond. 8 5 
M coe 3 : mo 1.42 
ability response ? à 
a ee 7 3 1.38 
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tion and B each estimate of T in the 
low response discriminability condi- 
tion, the probability of getting in a 
random sampling the arrangement 
AAABBB of the values in order of 
magnitude is 3!3!/6! = .05. Stimulus 
discriminability is then lower when 
stimuli have been associated with 
poorly discriminable responses than 
when they have been associated with 
highly discriminable ones. 


EXPERIMENT II 


In the preceding experiment .Ss 
were informed visually on stimulus- 
response assignment. [n many cases, 
however, this information is depend- 


ent on response production and is 
given by kinesthetic sensitivity. In 
Exp. II response discriminability is 
varied by the pattern of motor re- 


Sponse required and, therefore, by the 
pattern of kinesthetic feedback. 


Method. 


Apparatus.—The stimuli were eight rec- 
tangles20 mm. wide and varying in height from 
40 to 68 mm. in equal steps of 4 mm. They 
were presented by slides by means of an 
automatic projector. Presentation time was 
7.5 sec. 

The response display was adapted from 
Shepard's (1958) experiment. Eight 3-mm. 
metal contacts were arranged in a row along 
the bottom of a box. The S was required to 
insert an electric stylus into a narrow rec- 
tangular slot in the top of the box. Contacts 
were inserted in the bottom of the box in such 
a way that the Spaces between the contacts 
were at the same level as the contacts them- 
selves, and S was unable to distinguish 
individual contacts by tactual cues, Contacts 
were 12 mm. apart in the high discriminability 
response condition and 4 mm, apart in the low 
discriminability response condition, 
was arranged in front o 

isplay, particularly 
the slot and the stylus. "There were, therefore, 


no visual cues available for the control of the 
response. 


On each presentation .S 
the correct contact; a red 1 
the box informed S when 


had to try to find 
amp in the top of 
he had succeeded. 
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Each rectangle was associated with a contact ; 
contacts from left to right corresponded 
with rectangles of, respectively, decreasing 
size. The E closed the circuit for the required 
contact when the latter was touched by the 
stylus; when each rectangle was presented E 
placed the appropriate switch at the contact 
position. 

Procedure.—The Ss were told they were 
going to see different rectangles and had to 
find the position which corresponded to each 
of them at the bottom of the box; they were 
asked to remember the position so that they 
might find it again when the rectangle was 
presented a second time. 

During the learning period each stimulus 
was presented 12 times in a random order. 
This was followed by an identification task 
using the absolute judgment method, but S 
had to choose himself the number of cate- 
gories to be used. Each stimulus 
presented three times. 

Beside the high discriminability response 
condition and the low discriminability re- 
sponse condition a control condition was 
added, in which Ss were given only the identi- 
fication task without preliminary training. 

The Ss were 37 manual workers, 13 in 


each motor condition and 11 in the control 
condition. 


was 


Results 


We shall consider at first the num- 
ber of categories used in the identi- 
fication task, which is 
of the number of differ 
each S has perceived. The median of 
the distribution is 6.0 in the control 
condition, 5.6 in the high discrimina- 
bility response condition, and 4.3 in 
the low discriminability response con- 
dition. The difference between the 
two latter conditions, as tested by the 
Mann-Whitney U test, is significant 
beyond the .05 level, 

Transmitted information has been 
computed for the groups of Ss having 
used the same scale of judgment in 
each condition: these values an 
presented in Table 2 as well as the 


mean weighted index for eac 
tion. 


an estimation 
ent rectangles 


h condi 
The values for the high dis- 


criminability response cond 


i 1 ition are 
consistently higher th 


an those for the 
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TABLE 2 
TRANSMITTED INFORMATION FOR EACH CONDITION 


No. of Categori i 
No of papi | No.of Trenes Weighted 
Control 5 2 1.45 
cond. ó 3 138 146 
7 3 1.54 
High discrimi 
gn a n- 4 4 
ability response 5 2 13 3 
cond. 6 4 1.36 yas 
Low discrimin- » 3 2 0.93 
ability response 4 3 1.13 1.09 
cond, 5 4 1.14 l 


low discriminability response condi- 
tion. The probability of getting such 
an arrangement of the values in a 
random sorting, calculated as above 
is .05. We may then say that the 
discriminability of the rectangles has 
been lowered by their association with 
weakly discriminable motor responses. 


DISCUSSION 


The results of these experiments are 
consistent. Stimuli are less well dis- 
criminated when they have been associ- 
ated with weakly discriminable re- 
sponses. This is a confirmation of our 
hypothesis: decreasing response discrim- 
inability has the same effect as reducing 
the number of responses associated with 
stimuli and consequently discriminability 
is dependent on response discriminability. 

While many experiments have in- 
vestigated the effect of mediated equiv- 
alence of cues by having the same 
response associated to several stimuli, 
very few studies have used similar in- 
stead of identical responses. We know 
only two studies in which this paradigm 
was used, one by Dietze (1955) and one 
by Norcross (1958). In Dietze s experi- 
‘ment children had to learn either similar 
/BEEM, MEEM, PEEM) or dissimilar (yop, 
DAF, MEEP) words in response to shapes. 
The results show that shapes are easier 
to discriminate when they have been 
associated with dissimilar words than 
when they have been associated with 


similar ones. In Norcross’ study chil- 
dren had to pair two similar (zm, ZAM) 
and two dissimilar (wuG, Kos) nonsense 
syllables with four photographs and were 
then required to press one of four keys in 
response to each stimulus. More mis- 
takes were made for the pair of stimuli 
with which similar names had been 
associated than for the other. These 
results are consistent with ours: response 
similarity tends to increase stimulus 
similarity while response dissimilarity 
has the opposite effect. Le Ny (1959) 
has reported a similar effect: six tones of 
increasing pitch and equally spaced in 
jnd’s were associated with six letters 
(A, B, C, D, E, and F, respectively, for 
the control group and A, B, C, U, X, Z 
for the experimental group). The Ss 
had then to rate the difference in pitch 
between each tone: the third and the 
fourth tone (in increasing order of pitch) 
were rated more distant by the experi- 
mental group (for which C and U had 
been associated with these tones) than 
by the control group (for which C and D 
had been associated with these tones). 
The distance between stimuli may then 
be modified by the distance between 
responses. 

From these results it is possible to 
predict that if stimulus discriminability 
is dependent on response discrimina- 
bility, then reducing stimulus similarity 
would have little effect when response 
similarity is high. It would be more 
beneficial to reduce response similarity 


its 
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rather than stimulus similarity. In the 
same way, if stimulus similarity and 
response similarity are high, a greater 
facilitation effect would be expected from 
response predifferentiation than from 
stimulus predifferentiation. 
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solution may be consid 


Anagram 
n an algorithmic fashion 


permutes ir 
Ss may find it possible to 
attempting these permutations is 
word. 20 ad 
20 X 20 Latin square. 
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"Thorndike-Lorge (1944) count 
were “high” ruleout, i.e., perm 
120 permutations and 5 were 
of 70-76 of the permutations. 
were solved significantly more raj 
levels. 


“low 


Solution word frequency has been 
s a potent variable in 
lution times 
1958, 1959). 


established a 
predicting anagram so 
(Mayzner & Tresselt, 
However, even within a class of words 
of a designated frequency of oc- 
currence considerable between-words 

| variability in solution time remains. 
A number of hypotheses have been 
advanced to explain this variability ; 
among these hypotheses are: letter 
order of the anagrams, transition 
probabilities (bigram frequencies) of 
the anagrams, and transition proba- 
bilities of the solution words. 

Letter order has proved useful in 
predicting anagram solution (Hunter, 
1959; Mayzner & Tresselt, 1958); 
however, no objective way of classify- 
ing the various letter orders possible 
appears to exist. In the studies cited, 
letter orders were classified as “easy” 
or “hard” on the basis of inspection 
of the similarity or dissimilarity of the 
anagram letter order to the solution- 


upported in part by 
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Anagrams with “high” ruleout totals 
pidly (p € - 


01) at both frequency 


Erlebacher (1962) 
used Kendall's tau to classify system- 
atically the possible rearrangements 
of five-letter words. Her conclusions 
(p. 49) suggest that the similarity of 
the anagram letter order to the 
solution-word order is not relevant to 
speed of solution. 

Transition probabilities based on 
bigram frequencies (Underwood & 
Schulz, 1960) are not independent of 
word frequency, since they reflect the 
occurrence of various pairs of letters 
(bigrams) which are most commonly 
used in the English language. As a 
consequence, introduction of transi- 
tion probabilities as a variable may be 
expected to yield contradictory results 
(Beilin & Horn, 1962; Mayzner & 
Tresselt, 1959). 


At its simplest level, solution of a 
five-letter anagram by S may be 
conceived of as a simple problem- 
solving task which consists of trial and 
error manipulation of the five letters 
to form a five-letter word. Since 
there are 120 (5!) possible permuta- 
tions of the five letters, an algorithmic 
approach would inevitably result in 
However, solution attempts 


word letter order. 


success. 
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of this type quickly result in the 
discovery that certain bigrams and 
trigrams hypothesized as the beginning 
of the solution word simply do not 
appear in English words. With the 
elimination of each of these letter 
sequences (and the possible permuta- 
tions of the remaining letters) a large 
number of the 120 possible combina- 
tions may be ruled out as impossible 
or highly improbable solutions. 

To illustrate, consider the anagram 
HIGTL. Among these letters the 
following bigrams and trigrams simply 
do not occur as the beginning sequence 
of letters in English words: HG, nr, 
HL, GT, TG, TL, LH, LG, LT, GHT, GHL, 
GLH, GLT, THL, and THG. Each bigram 
rules out six possible orders and each 
trigram rules out two orders. Thus 
logical analysis rules out 54 + 12 = 66 
of the 120 possible permutations. 

Rarely do five-letter words begin 
with a vowel. A count at two levels 
of word frequency (five-letter words 
in the first thousand and seventh 
thousand of the Thorndike-Lorge, 
1944, count) revealed that less than 
10% of the five-letter words in these 
2,000 words begin with a vowel. If 1 
then, beginning a word with a vowel 
is also ruled out, for the anagram 
HIGTL, 24 additional permutations can 
be eliminated, thus bringing the final 
total ruled out to 90 (66 + 24) of the 
120 permutations possible. 

Mayzner and Tresselt (1958, 1959, 
1962) suggest that anagram solution 
is an S-R mediational type of problem 
solving. In their model the anagram 
evokes a "variety of implicit proc- 
esses" from S which relate letter order, 
word frequency, etc., in such a way as 
to produce solution in varying lengths 
of time. The present study, while not 
inconsistent with the mediational ap- 
proach, suggests the possibility of 
looking at these presumed implicit 
processes from the standpoint of S's 
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intuitive acquaintanceship with the 
structural properties of the English 
language. Thus, S, on exposure to an 
anagram, chooses for a provisional 
try two or three letters which, from 
his prior acquaintance with English 
words, can be expected to lead to an 
English word. Since Miller (1951) 
indicates that over 55% of words in 
the English language follow the form 
consonant-vowel (cv) or consonant- 
consonant-vowel (ccv), S is likely, 
following the present model, to choose 
as initial letters either cv or cc (to be 
followed by a vowel). This may ex- 
plain Hunter's (1959) finding that 
vowel-beginning words form much 
more difficult anagrams than do con- 
sonant-beginning words. 

Following the provisional try, .S 
either succeeds (solves the anagram) 
or rejects the letter sequence and 
chooses another set of letters and re- 
peats the process. From this line of 
reasoning it is clear, then, that 
anagrams having a relatively small 
number of two- or three-letter com- 
binations (high ruleout) which po- 
tentially yield English words when the 
remaining letters are attached should 
be more rapidly solved, “on the aver- 
age," than anagrams with 
possible initial bigrams or 
(low ruleout). 

While it is clear that the algorithmic 
Process mentioned above is not carried 
out by S, it seems reasonable to as- 
sume that S may be performing a 
similar task on a more intuitive basis; 
consequently, the purpose of this 
experiment is to test the hypothesis 
that words which have a high ruleout 


total will be more quickly solved than 
those with a low ruleout total. 


many 
trigrams 


METHOD 


Subjects —Twenty graduate student volun- 
teers from Indiana University served as Ss. 
Twelve of the 20 were females, 


eo 
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TABLE 1 


ANAGRAM SOLUTION TIMES (IN Sec.) FOR INDIVIDUAL ANAGRAMS AND FOR 
Four Coxprrioxs or WORD FREQUENCY AND RULEOUT 


Mean 
Cond. Solution | anagram Mdn. Range Ma AE 3D Anagrams 
Word XE X | “Hor 
Block 
High Frequency, | LIGHT ITLGH 4.0 1-36 
High Ruleout MONTH | NMTHO 6.5 1-180 
(90-94) TNDSA 17.5 | 5-180 9 32 16.9 18.2 
M =92 DHCIL 6.5 1-147 = 
LNTPA 12.5 1-180 
High Frequency, wreat | 100.0 | 19-180 
Low Ruleout HULAG 28.0 1-180 
(72-76) HIETW 45.0 | 5-180 44 77 22:5 15.0 
M = 7+ PRICE ICREP 131.0 2-180 
THEIR RTHIE 17.0 2-180 
Low Frequency, WRING 20.0 5-180 
High Ruleout 8.0 1-180 
(90-94) AD 180.0 2-180 26 68 23.1 15.0 
M = 92 RAPCM 25.0 3-180 
BLUNT | BNTUL 50.0 2-180 
Low Frequency, | CHAOS OCHSA 180.0 | 10-180 
Low Ruleout pOUCH | HPCOU 22.0 2-180 
(70-76) DEITY ITDYE 180.0 | 25-180 69 99 24.0 12.0 
M 2714 SCOUR | OCURS 71.5 4-180 
BROIL OLBRI 36.5 3-180 


Stimulus material.—Vhe stimulus material 
consisted of 20 five-letter anagrams typed in 
capital letters on white 3 X 5 cards. The 20 
words chosen for examination were selected 
from the Thorndike-Lorge (1944) word list 
at two frequency levels: 10 words from the 
first 500 words (occurring more than 100 
times per million words) and 10 from the 
seventh thousand in frequency (occurring 
eight or nine times per million words). 

Examination of “ruleout”’ totals for 75 
five-letter words revealed a range from 66 to 94, 
with an M = 82 and an SD equal to 6.1. At 
each frequency level then, five words were 
chosen with “high ruleout" totals (90-94) 
and five with “low ruleout" totals (70-76). 
All anagrams were presented in “hard” 
letter arrangements. That is, an attempt was 
made by the writer to present the anagrams 
in orders as dissimilar to the solution word as 
possible. Because of the interest in initial 
bigrams and trigrams, transition probabilities 
of solution words and anagrams were not 
controlled. The words and anagrams are 
presented in Table 1. . 

Procedure.—The anagrams Were placed in a 
20 X 20 Latin square to control sequence and 


practice effects. All Ss were tested individ- 
ually. Each was told that a five-letter 
anagram is five scrambled letters which, 
properly rearranged, will form one five-letter 
English word. They were also informed that 
they would have a maximum of 180 sec. to 
solve, and that if no solution occurred in that 
time, the correct solution would be provided 
before the next anagram was presented. No 
example was provided. Solutions were given 
verbally by S and times were recorded by 
means of a stopwatch. 


RESULTS AND DISCUSSION 


Available for analysis were 20 
scores for each S. Because of the 
skewed distribution of the scores and 
the truncation imposed by the ceiling 
of 180 sec. per word, median time 
scores were selected as best represent- 
ing each S's performance. As a con- 
sequence, nonparametric statistical 
methods were used throughout the 
analyses. 
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As pointed out above, word fre- 

quency is a potent variable in predict- 
ing anagram solution time. Yet if the 
*ruleout" hypothesis has validity, 
high-frequency words with low ruleout 
totals should require considerable 
information processing before solution 
is reached; conversely, low-frequency 
words with high ruleout totals should 
require considerably less information 
processing and would consequently 
result in more rapid solution. As a 
result of this reasoning, E predicted 
the following order for the four groups 
of words used (arranged in order from 
fastest to slowest time to solution): 
(a) high frequency, high ruleout; (b) 
low frequency, high ruleout; (c) high 
frequency, low ruleout; (d) low fre- 
quency, low ruleout. 

Page (1963) has developed a sta- 
tistic (L) as a test of the null hy- 
pothesis against an ordered alter- 
native. Thus, if the null hypothesis 
is expressed by Ho:m, = m. = ms 
= ma, then the ordered alternative 
here is Hi:mi < ms < ms < my. 

For purposes of this analysis the 
median of the scores for each block of 
five words for each S was obtained. 
These medians were then ranked, pro- 
viding a matrix of rankings for the 20 
Ss. In the five cases of ties, a coin was 
tossed to assign the appropriate 
ranking, 

The resulting L value (567) exceeds 
the critical value at the .001 level. 
Thus it may be inferred that signifi- 
cant agreement exists between the 
predicted order of anagram difficulty 
and the orders of solution times by Ss. 

The major concern of the present 
paper is the examination of the effects 
of high or low “ruleout” on anagram 
solution time. Figure 1 graphically 
illustrates the effects of *ruleout" on 
both high- and low-frequency words. 
Since the two lines are essentially 
parallel, it is possible to infer that no 
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Median time to solution for all 
words for all Ss under each of the four condi- 
tions of frequency and “ruleout.”” 


interaction occurs between frequency 
and “ruleout.”” 

At each word frequency level the 
sign test (Siegel, 1956) may be used to 
test the following hypothesis: 


Ho:P >4 
Hi:P <4 
N = 20 
When 
P —probabilty that Mduno— Mdyno 
is positive, 
Where 


Mdgno — median solution time for high 
"ruleout" anagrams. 
Mdyxo — median solution time for low 
"ruleout" anagrams. 


For both high- and low-frequency 
words, anagrams with high ruleout 
were solved significantly more quickly 
than those with low ruleout (b < .01 
in each case). 

Five-letter anagrams constructed 
from high-frequency words are, as pre- 
vious findings suggest, more quickly 
solved than those constructed from 
low-frequency words, However, the 
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present study suggests that for ana- 
grams constructed from these low- 
frequency words, presence of relatively 
few initial bigrams which can logically 
be formed into English words when 
combined with the remaining letters 
results in more rapid solution than 
anagrams constructed from high-fre- 
quency words with many possible 
initial bigrams which will form English 
words. 
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Exposure duration and the number of times the word was flashed 
(trials) were varied independently in order to investigate the growth 
of the perception of a word. With duration constant, the probability 
of perceiving a word increased with exposure trials so that the word was 
quite clear and easily identified after a number of flashes, even if the 
first flash appeared blank. The function relating the probability of 


perceiving a word to the number of exposure trials could be specified 
knowing only the asymptote (maximum probability attainable) and 
the probability of perceiving the word on the first exposure. Despite 
this effect of repeated exposures, the probability of perceiving a word 
was always higher for a single flash at a given duration than for 2 or 
more flashes at shorter durations summing to the same total duration. 


Word-recognition thresholds have 
been studied extensively in the past 
decade with particular emphasis on 
such variables as frequency, recency, 
word length, set, meaning, and value. 
Typically, the ascending method of 
limits has been used to determine 
thresholds defined as the duration at 
which a word is correctly recognized. 

Little attention has been given to 
the detailed analysis of the perceptual 
processes involved in the growth of 
the perceptual experience of a word. 
From this point of view, the number 
of exposure trials becomes relevant 
as an independent variable and, along 
with exposure duration, more than an 
index of the effects of other variables. 

Since exposure trials and exposure 
duration have been confounded in 
previous studies of word recognition 
using the method of limits, the present 
experiment was undertaken to study 


! This research was partially supported by 
a grant from the National Institute of Mental 
Health (MH-03244) to the first author. We 
wish to thank Martha Breen for her assistance 
in the collection and analysis of the data and 
Michael Reed for his assistance in the mathe- 
matical analysis. 

2 Now at the University of Rochester, 

3 Now at the University of Wisconsin. 
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these variables independently. Spe- 
cifically, this study was directed at 
assessing the effects of repeated ex- 
posures of words upon S's ability to 
perceive them when exposure duration 
was invariant. Two questions guided 
the research: (a) Do repeated ex- 
posures of a word at a constant 
duration contribute to the growth of 
the phenomenal experience of the 
word? (b) If trials is a contributing 
variable, then in what way does it 
operate, and what is the relation be- 
tween trials and the well-documented 
effects of exposure duration? 
Moreover, with independent varia- 
tion of trials and duration, it should 
be possible to assess the relative 
contributions of these variables to the 
development of the percept. 


METHOD 


Stimuli—Each S was shown 504 seven- 
letter, three-syllable English words, selected 
from the Thorndike-Lorge (1944) word count 
and from the Webster's Unabridged Dictionary 
(1939). The most frequent two-thirds of the 
615 seven-letter, three-syllable words from 
the Thorndike-Lorge list (general summary 
count—G) were used. An additional 500 
words of the same structure were culled from 
Webster's. Judges, familiar with the differ- 
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ences between rare and nonrare words in the 
Thorndike-Lorge list, selected 94 words that 
were judged to be nonrare. Thus, the 504 
words represent a sample (virtually the 
population) of seven-letter, three-syllable 
words that appear not infrequently in print. 
The words were divided into nine groups 
of 56 words each (corresponding to nine 
experimental sessions) according to a random 


process. They were printed on a 6-in. wide 
paper tape, 


using a LeRoy lettering stencil, 
No. 3240-290CL and pen, No. 3233-4. The 
letters were % in. high and the words were 
2} in. long. The words were drawn so as to 
equalize the spaces between the seven letters. 
The vertical spacing between the words was 
lin. ‘The horizontal visual angle of the words 


was 24°. 

The words were presented singly in the 
rear of one channel of a three-channel mirror 
tachistoscope (Scientific Prototype Manu- 
facturing Corporation, Model D). A second, 
background, channel was always lighted, 
containing faint lines above and below the 
word which served as fixation boundaries. 
The stimulus durations were monitored be- 
fore and after the experiment and varied by 
less than -£0.1 msec. The reflectance meas- 
ured at the eyepiece with a Macbeth il- 
luminometer was 10 ft-L for the background 
and 18 ft-L for the stimulus plus background. 

Procedure.—Each word was assigned to 
one of seven duration values, ranging from 
5 to 35 msec, in steps of 5 msec., and one of 
eight exposure numbers (1, 2, 3, 4, 5, 10, 15, 
25), representing the number of times it would 
be exposed. Thus, for each session, one word 
was gned to each of the 56 Duration 
X Trial combinations. The assignment of 
words to these combinations was random for 
each session. 

Since no prior threshold measurements had 
been made for 5s, durations that were either 
too high or too low (in terms of lack of 
variability) were dropped after the first day. 
Thereafter, five duration values were used, 
although not necessarily the same ones for 
each S. "Therefore, comparison between Ss 
required that they be assigned relative values. 
The lowest of these five durations at which S 
perceived more than just an occasional word 
was defined as his threshold (T) duration. 
The other durations were designated as T — 5, 
T+5, T+10, T+ 15. Approximately 12 
words in each of the 5 X 8, Duration X Trial 
combinations were shown over the 9 days. 

The stimuli were exposed when S pressed 
a button to trigger the T-scope. Hence, S was 
always prepared for the flash. He always 
knew when a word was being changed, but he 
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never knew before an exposure whether that 
would be the last time he would see the word. 
The S was told that he would not know this 
and that he should treat each exposure as if 
it were the last one. Intertrial interval was 
never less than 8 sec. 

For each stimulus exposure, 5 was required 
to report verbally the letters that he was 
certain he perceived and their positions, even 
when he was certain of the word. This was 
given great stress in the instruction. All of 
the analyses reported are based on_ the 
identification of letters, not of words. The S 
was said to have perceived the word if he 
correctly identified all of the seven letters on 
any of the exposures. This criterion, using 
letters rather than words, maximizes the 
probability of reports based on what S saw, 
rather than on what he thought he saw, or 
what he was sure must have been presented, 
but did not actually see. 

Subjects.—Ten Yale undergraduates, en- 
rolled in the introductory psychology course, 
served 9 hr., 1 hr. per day, over a period of 
2 wk. They were tested individually. 


RESULTS 


'The data to be presented below can 
best be understood by first noting the 
phenomenal reports of Ss as they 
participated in the experiment. lf 
the duration of a word was low, S was 
usually unaware of letters or parts of 
letters on the first flash—the flash 
was blank. On the second or third 
flash, with no change in duration, 
beginnings of letters and sometimes 
whole letters would appear. After 
several more flashes, a number of 
letters would be present—often the 
whole word. 

The percept of the word that 
developed after repetition was in no 
sense fuzzy, hazy, or the product of a 
guess. It assumed very clear status, 
so that S was never uncertain about 
his report, even though he was unable 
to see anything a few exposures 
earlier. These reports were a dramatic 
demonstration that perceptual thresh- 
olds must have been a function of 
repetition as well as energy and 
duration of stimulation. 
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TABLE 1 


PERCENTAGE OF WORDS IN WHICH ALL LETTERS WERE CORRECTLY IDENTIFIED, 
rà A RUNGEION OF EXPOSURE DURATION AND NUMBER OF EXPOSURE TRIALS 


Trials | 
Average 

B 1 2 3 4 5 10 15 25 

0.9 0.0 1.0 0.0 1.9 1.9 2.9 1.1 
T min ES 19.2 25.6 38.7 33.9 51.7 58.2 62.3 | 37.1 
ti 5 38.7 71.3 74.6 88.6 84.3 93.3 92.6 96.7 80.6 
aT T 10 61.9 89.0 92.5 90.6 96.7 98.4 98.3 99.2 91.7 
T 4-15 80.9 93.9 97.5 100.0 96.4 97.5 98.8 100.0 95. 

Note.—Ten Ss per cell; an average of 12 different words in each cell, 
The data supporting these phe- given by 
nomenal impressions are given in Ps.-—1—g [1] 


Table 1 which presents the percent- 
ages of words perceived (Le., all 
letters correctly identified) as a func- 
tion of the duration at which the word 
was presented and the number of 
times it was exposed. It is clear that 
varying the number of exposure trials 
increased the likelihood of perceiving 
words. 

Two functions, based on different 
models, produced reasonably close 
fits to the data points for three dura- 
tions: T, T + 5, T +10. One func- 
tion is given by the expansion of the 
binomial (p, + qı)”, where pı is the 
probability of Perceiving a word on 
Trial 1, 91 = 1 — f, and n is the 
number of trials. This model is 
based on the assumption that the 
individual trial Probabilities are in- 
dependent of the number of trials, ie., 
the probability of Perceiving a word 
would be the same on Trial » + 1 as 
on Trial m. Further, this model 
predicts that trials Should have a 
cumulative effect on accuracy such 
that as S is given more trials, he 
should be able to perceive 4 greater 
proportion of words. Thus, while the 
probability of perceiving a word on a 
given trial is constant, the probability 
of perceiving a word by Trial n is 


which increases with n. This function 
is plotted in Fig. 1 in broken lines, 

The second function (fitted by the 
method of least squares, and plotted 
in Fig. 1 as solid lines) is 


Pa = A — B/n, [2] 


where A is the maximum probability 
of perceiving a word at a given 
duration (estimated on the basis of 25 
trials), Bisa constant, the meaning of 
which will be discussed below, and 7 is 
the number of trials. For the effects 
of trials on accuracy to be significant, 
the exponent of n (x) can not be zero. 
This exponent (respectively, 1.05, 
0.85, and 1.14) differed significantly 
from zero (p < 01) for each of the 
three durations, 

The asymptote of Function 2, or 
the maximum probability of a word 
being perceived for a given duration 
(in above terms), is given by 


A = lim Pa, 


5 -— © 


and may take any value between zero. 
and one depending upon the duration. 
However, the data indicate that 4 


will be 1.00 except when the duration 
of the stimuli is very short. 


PROBABILITY OF PERCEIVING A WORD 


Fic. 1. Probability of perceiving a W 


REPETITION AND GROWTH OF A PERCEPT 
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DURATION: T+10 
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The value of B was close to that of 
qı for all three durations and the 
exponents of z did not differ signifi- 
cantly from 1.00. The function 


Pa= A — qi/n [3] 


is plotted in dashed lines in Fig. 1 and 
closely approximates the empirically 
derived functions. Thus, for all but 
relatively short durations where 4 
=Æ 1.00, the value of P; can be calcu- 
lated from estimates of qı obtained 
from a list of words presented for one 
exposure per word. 

Within each of the three durations, 
the binomial expansion is generally 
below the data points (and the other 
functions) for the first few trials and 
above them thereafter. This implies 
that, for the first few trials, the 
probability of perceiving a word on 
a given trial is increasing from trial to 
trial. The converse seems to occur 
after about 10 trials, indicating that 
if Ss have not perceived the word by 
then, the chances of their doing so on 
any given trial decreases with addi- 
tional trials. 

The relative contributions of trials 
and duration may be assessed by 
determining the optimal distribution 
of any given total number of milli- 
seconds of exposure time. That is, 
given a total of, Say, 50 msec. of 
exposure time, does one flash of 50 
msec., or two flashes of 25 msec., or 
five flashes of 10 msec. maximally 
facilitate the veridical perception of 
the stimulus? Note that introducing 
trials as a variable while keeping total 
stimulus exposure time constant not 
only requires shorter exposures per 
trial as the number of trials increases, 
but also introduces a time interval 
between successive exposures. Table 
2 illustrates that one long flash was 
always better than two or more 
shorter ones summing to the same 
total duration. This suggests that 


TABLE 2 


PERCENTAGE OF WORDS PERCEIVED WHEN 
THE TOTAL NUMBER OF MILLISECONDS 
or Duration Is DIVIDED INTO 
ONE oR MORE Exposures 


Flash Duration 
Total 
Msec. 
10 15 20 25 30 
20 3:1 42.0 
30 5.1 26.1 74.1 
40 9.2 67.5 
50 6.0 85.1 
60 7.0 | 37.7 | 68.3 93.7 
75 35.6 91.4 
80 8.0 86.8 
90 &5 | 37.0 97.3 
100 9.1 81.7 | 91.2 
120 9.5 | 42.0 | 83.0 89.7 
150 10.1 | 47.4 95.5 | 96.3 


, Note.—The number of flashes in each cell is ob- 
tained by dividing the total number of milliseconds by 
the flash duration, Trial No. 6, 7, 8, 9, and 12 were 
approximated using Function 3. 


exposure duration was a more im- 
portant variable than the number of 
exposure trials. 

The relative contributions of dura- 
tion and trials may be further ana- 
lyzed by comparing the data of this 
experiment with threshold measure- 
ments obtained with more traditional 
methods where duration and exposure 
trials are confounded. For example, 
comparison can be made between the 
achieved accuracies after, say, the 
fourth flash at 25 msec., when either 
the first three flashes were also at 25 
msec. in duration (this experiment), 
or when the first three flashes were, 
Say, 10, 15, 20 msec. (traditional ex- 
periments). Figure 2 presents data 
from two such experiments: the new 
method curve is derived from results 
of the present experiment; the old 
method refers to the data collected 
on à comparable group of 10 Ss using 
exactly the same procedures, equip- 
ment, and values as in the present 
experiment, except that the durations 
increased 5 msec. with every trial, 
It is clear that except for the first 
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1.00 
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60 


——— TRADITIONAL METHOD 


40 
s... NEW METHOD 


.20 


PROBABILITY OF CORRECT IDENTIFICATION 


5 TRIALS 
30 DURATION (msec) 


[i 2 3 4 
10 15 20 25 
TRIALS x DURATION COMBINATIONS 


Fic. 2. Probability of correctly identifying 
a word as a function of method of combining 
trials and durations. 


point, where, of course, there should 
be no difference, the new method 
results in higher accuracy. Thus, 
holding trials constant, the higher the 
duration of the earlier trials, the 
greater the accuracy. 


DISCUSSION 


The data demonstrate that repeated 
exposures of a word at a constant dura- 
tion does contribute to the growth of the 
phenomenal experience of the word. The 
prediction of accuracy achieved after any 
number of trials requires knowledge only 
of the probability of perceiving a word 
on the first trial, and the maximum 
probability after many trials. This in- 
formation can be used in one of two 
different functions to predict accuracy: 
the binomial expansion based on the 
assumption that the individual trial 
probabilities are independent and con- 
stant from trial to trial, and the function 
P, = A — B/n, where A is the maxi- 
mum probability, B is the probability 
on the first trial, and ? 1$ the number of 
trials. 

The second function seems to fit the 
data points better. It differs from the 
binomial in that it falls above the 
binomial for the first few trials, and 
below it for the remaining ones. Since 
the binomial is an index of independence, 


it appears that information received and 
processed on one flash facilitates percep- 
tion of a word on the next flash for the 
first few trials. Thereafter, the effect 
appears to decay, and possibly becomes 
inhibitory in nature. This could be 
caused by S relaxing his effort after 
repeated failure to perceive the word, 
or comparing the information obtained 
with impossible or very unlikely words. 

The latter reasoning is consistent with 
Bruner’s (1957) early thinking about 
perceptual recognition. However, it is 
directly contrary to Bruner’s more recent 
work (Bruner, 1963), where he finds that 
stimuli first presented in an impoverished 
form (Bruner currently uses a procedure 
where the stimuli are out of focus) are 
more difficult to recognize. Bruner sug- 
gests that this occurs because S is at- 
tempting to fit the stimuli to a hypothesis 
that is most likely wrong (because the 
stimulus was impoverished) and there- 
fore takes extra trials to disconfirm that 
hypothesis before he can go on to check 
more correct ones. Applied to the 
present experiment, this reasoning would 
predict that the early trials, which are 
clearly impoverished, should fall below 
the binomial. That is, having been 
exposed to one impoverished flash, sub- 
sequent flashes should be harder to 
perceive. The data are contrary to this 
prediction. It is the early trials that are 
above, or at least equal to, the binomial, 
not below it. Further, this is most 
clearly the case for the threshold dura- 
tion, where the stimuli were most 
impoverished. 

A related problem concerns the failure 
of the threshold curve to approach 1.00. 
One explanation might be that the curve 
would approach unity if enough trials 
had been given. This appears unlikely 
from the shape of the curve. 

A second alternative is that certain 
rare or difficult words were never identi- 
fied when presented at threshold dura- 
tions. However, an examination of the 
words at threshold duration indicated no 
tendency for specific words to be missed 
more frequently than other words. 
Moreover, since the words were ordered 
randomly with respect to duration, trials, 
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and days, it is unlikely that specific word 
characteristics could have systematically 
affected the results. 

A third alternative warrants more de- 
tailed consideration. If the image of the 
stimulus from which S is working fades 
rapidly at very low durations so that S$ 
can not “scan” all of the letters, he may 
not be able to perceive them all before 
they fade. This interpretation is con- 
sistent with recent evidence (Averbach 
& Sperling, 1961) which suggests that 
short-term memory fades quite rapidly 
and allows S only a short time, perhaps 
much less than a second, to process the 
letters in the stimulus. If processing of 
a seven-letter word takes a given length 
of time, and the duration of the flash 
leads to a short-term memory shorter 
than that time, then S will never cor- 
rectly identify all of the letters. This 
interpretation suggests that the threshold 
duration in this experiment was probably 
very near the threshold for short-term 
memory, so that, with oscillation, some 
of the words were retained long enough 
to be identified while others were not. 
In examining the raw protocols, many 
words were found for which Ss saw all 
of the letters, but never on the same 
trial. This occurred a number of times, 
though it does not account for all the 
words which were never fully perceived. 
There were many words for which .S 
never saw all of the letters even on the 
different trials. 

If the short-term memory explanation 
of the failure to reach asymptote at 
unity is correct, then the eflects of 
duration on perception can be reinter- 
preted. Duration would now be seen as 


a discontinuous function such that words 
presented below a given value (allowing 
for oscillation) would never be perceived 
regardless of the number of trials given. 
This is not to say that duration is no 
more than a dichotomous variable with 
respect to its effects on perception. 
There is ample documentation that in- 
creasing the stimulus duration will in- 
crease the accuracy of perception. How- 
ever, this relationship leaves unanswered 
problems. For example, if the evidence 
regarding short-term memory is correct, 
why should variation of a few milli- 
seconds in duration of the stimulus affect 
accuracy when presumably the stimulus 
produced a short-term memory lasting 
hundreds of milliseconds? Much more 
research is needed before the relation of 


duration of stimulation to perception 
becomes clear. 
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discrimination, 


multiple-process model, based on strategy 


to fit the data. 


One branch of the theory of paired- 
associates (PA) learning, stemming 
from E. J. Gibson's (1940) work on 
stimulus differentiation, says that a 
PA may be learned through several 
processes. Underwood and others 
(Underwood, Runquist, & Schulz, 
1959: Underwood & Schulz, 1960) 
have extended the idea, and shown 
that along with stimulus discrimina- 
tion there are separable processes of 
response discrimination and response 
integration. McGuire (1961) has 
partitioned errors into those caused 
by confusions between stimuli, those 
caused by lack of response integration, 
and associative failures. If S gives 
the | response that belongs with a 
similar stimulus, then the error is 
attributable to stimulus confusion. 
If the response S makes is part of, or 
similar to, the correct answer, then 
the error is a failure of response 
integration or discrimination (Shep- 
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each of which was all-or- 
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associates list with 8 unique items and 8 confusable twinned 
as administered to 50 Ss by the anticipation method. Re- 
on nouns, highly integrated and discriminable, and 
Unique items were learned in an all-or-nothing 
re learned in 2 steps, association and stimulus 


nothing. A discontinuous 
-selection theory, was shown 


ard, 1957). Omission or any other 
error is probably caused by lack of the 
correct association. This analysis of 
errors gives a picture of the processes 
involved in learning a PA list. 

A second diverging branch of 
paired-associate theory begins with 
Rock (1957) followed by Estes (1959, 
1960), Estes, Hopkins, and Crothers, 
(1960), and Bower (1961). These 
authors assert that ordinary PA 
learning occurs as an all-or-nothing 
process, and show that if stimuli are 
easily discriminated, responses are 
discrete and well known to S, and 
training proceeds at a leisurely pace, 
all-or-nothing (discontinuous) data 
may be observed. 

Three main experimental methods 
have been used to test the all-or- 
nothing hypothesis; Rock's item- 
replacement method (Rock, 1957), 
Estes’ miniature experiment (Estes, 
1960; Estes et al, 1960), and the 
method of teaching a PA list by 
anticipation and then analyzing the 
data in detail (Bower, 1961; Bush & 
Mosteller, 1959). We employ the 
third method, using level of perform- 
ance before the last error, the learning 
curve, P (8n), the distribution of total 
errors (T), and the distribution of 
trial of last error ('), as summary 


descriptions of the data. 
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In Bower's all-or-none PA model .S 
with respect to a given item begins in 
State So. His responses in Sp are 
correct or wrong with probabilities 
pandg. On any trial with probability 
c he shifts to State S, on that item, 
and makes no more errors. 

Before the last error, performance 
is stationary with a probability 5 of 
giving a correct response. The learn- 
ing curve, probability of an error on 
Trial z, is given by 


P(&)-—g(1—2o*- [1] 


Distribution of Total Errors per Item 
(To): 


PU y= 0) = bp [2] 
P(To = k) = (1 — bp)b(1 — by 
for any k > 0. 


Here b is the probability of no more 
errors following a given error, and is 
c 
gt pc 


The distribution of total errors per 
item has mean 


E(To) = q/c = us 
and variance 
Var (To) = us + (1 — 2c)u*s 
Trial of Last Error (n'o) 


P(n'o = 0) = bp 
P(n') = k) = qb(1 — cya 


[3] 


for all k > 0. 
The mean trial of the last error is 


E(n') = mo = buo/c 
2 
Var (n'o) = Mo ( o = f mo). 


An older all-or-nothing learning 
theory (called the “noncontinuity” or 
"insight" theory) has been applied 
to discrimination learning (Lashley, 
1929). Several recent papers have 
revived this theory as a mathematical 


model (Bower & Trabasso, 1963; 
Restle, 1961, 1962; Trabasso, 1961, 
1963). Assume that S begins with 
some strategy on which he bases re- 
sponses. When a strategy fails, S 
resamples with replacement from his 
set of strategies. Suppose that strat- 
egies can be partitioned into irrelevant 
ones, which lead to success and failure 
with probabilities P and Q, and cor- 
rect strategies which always lead to 
success. Let S; be the presolution 
state of using an irrelevant strategy 
and Sz be the solution state of using 
a correct strategy. Let d be the 
proportion of correct strategies. The 
S will go from S; to S; only when his 
strategy fails, and then only with 
probability d. This model has the 
properties, 

Learning Curve: 


P(&) = Q1 — Qd)'(1 — d), [4] 
Distribution of Total Errors: 
P(T: = k) = (1 — dj'a, 
E(T) -(0 —4)d [5] 
Var(Ti)) = (1 — d)/d* 
Distribution of Trial of Last Error: 
P(w;-0)-d [6] 
Pi, =k) = (1 — Qd)'"Qd(1 — d) 
for all k > 0. 


Bas) = A= d)/Qd = mı 
Var (73) = mi(2 + Qd — m). 


The Bower association model and 
the noncontinuity theory of discrimi- 
nation learning can be combined into 
& more comprehensive theory of PA 
learning. To obtain all-or-none PA 
data, Es eliminate all Sources of 
difficulty except simple association. 
In particular, both stimulus dis- 
crimination and response integration 
are made very easy. However, PA 
experiments may involve several proc- 
esses other than association. To 
account for the results of such ex- 
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periments we propose a Discontinuous 
Multiprocess theory in which each 
process is an all-or-nothing event. 
This idea has been developed and 
given detailed support by analysis of 
some existing data (Restle, 1963). A 
more definite test is made herein by 
first showing the distinctive mathe- 
matical properties of a discontinuous 
two-process model, then performing 
ə detailed experimental test. 

The model is as follows: Initially, in 
State So, S does not know the correct 
response and guesses among the 
response alternatives, having a con- 
stant probability P of being correct 
and distributing his errors among 
various responses. With probability 
c on any trial he learns the response to 
that item. If the stimulus is unique, 
S has mastered the item. If there is a 
confusable twin stimulus, then 5 may 
enter an intermediate State Sr in 
which he knows the response but can- 
not differentiate between stimuli. He 
therefore, with probability Q, makes a 
stimulus confusion error. On any 
error S resamples and may, with 
probability d, choose a discriminating 
strategy and stop confusing the twin 
stimuli. At that time he enters State 
Sz and since he has both formed the 
association and the discrimination, he 
performs correctly. Both steps of 
learning can occur on the same trial. 
The arrangement of states is as shown 
in Fig. 1. 

The following statistical properties 
can be derived by elementary, though 


somewhat laborious methods: 
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Fic. 1. Arrangement of s 
two-process model. 


Learning Curve: 


e Je — 
P(&) =q(1—o + eC 


xEa-ag -a-2771 [7] 
Distribution of Total Errors: 


P(T =k) = P i 
x[(t-4ap)(-— d)” 


— (1 — bp) (1 — b)"] [8] 
E(T) = E(To) + E(T:) 
from Equations 2 and 5. 
Var(T) = Var(To) + Var (Ti). 


These results follow because total 
errors is the convolution (Feller, 1957) 
of the distributions of errors in States 
So and Sr. 

Trial of last error is dealt with in 
two parts. lf no errors are made in 
State Sr (with probability d), then the 
last error is made in So as given in 
Equation 3. If an error is made in Sr 
(probability 1 — d), then the trial of 
the last error is the convolution of the 
distribution of trials in State So (not 
trial of last error in that state) with 
the distribution of trial of last error 
in Sy. This gives Trial of Last Error: 


P(n! =k) =dbp fork =0 [9] 
= dgb(i — c) + eo 
[as aet aged 
for k > 0. 


E(n') = dmo + (1 — d) | : + 5] 


For mathematical details and sev- 
eral related applications see Restle 
(1963). 

This discontinuous two-process 
theory agrees in substance with Mc- 
Guire's position that PA learning may 
involve separate processes, and that 
the processes can be separated experi- 
mentally and also by analysis of 
errors. The learning curves of the 
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separate types of errors are fit well by 
both theories. 

The discontinuous theory is like 
McGuire's, but goes beyond. First, 
by defining specific states, it makes 
possible the use of ordinary discrete 
probability theory. Second, the dis- 
continuous theory generates not only 
mean group effects, but distributions 
of statistics. An extra advantage 
of the discontinuous multiple-process 
theory is that it gives a definition of 
an irreducible process. The analysis 
of PA learning into separate processes 
is terminated whenever all processes 
yield geometric distributions. This 
prevents an infinite regress of sub- 
divisions which might occur with con- 
tinuous models and a determined 
theorist. To test the discrete two- 
stage theory, we performed a direct 
experimental analysis of the effect of 
stimulus similarity in PA learning, 
to see if it produces a separate and 
unitary stage of learning. 


METHOD 


A mixed-list design was used. Every S 
learned a list composed half of PAs with 
unique stimuli and half of PAs with confus- 
able twinned stimuli. 


Subjects.—In fulfillment of course require- 
ments, 52 introductory psychology students 
served as Ss. Two failed to learn in 30 trials, 
were found not to have understood instruc- 
tions, and were discarded, leaving 50 Ss in 
the analysis. 

Apparatus and materials.—The Ss sat at a 
table across from E and were shown flash 
cards over the top of an 18-in. screen. Stimuli 
were symbols and outline pictures; all 16 are 
shown in Fig. 2. As can be seen there were 
8 unique and 8 twinned stimuli. Responses 
were the words COST, HOPE, PART, RUSH, and 
ONLY, which are among the most frequent 
1,000 words (Thorndike & Lorge, 1944). 
Responses were assigned randomly to stimuli, 
separately for each S, with the restriction that 
twinned stimuli never both had the same 


response. Also no two pair of twinned 
stimuli had the same pair of responses for a 
given S. 

Procedure.—Before beginning the PA t. 
S was taught the five responses by havin 
repeat them in an order said by E. 
different orders were given until all were 
recited perfectly, and then .S Was required to 
repeat the words in any order, All Ss 
succeeded, 


In instructions, S was informed that some 
stimuli would be v. 


ery similar to others; that 
each stimulus had one and only one correct 
response which would not change; and that 
each response would be the answer to more 
than one stimulus. 


The stimulus card w. 
sponded, after which 
answer orally. N 
E recorded the re 


ask 
g to 
Five 


as shown until S re- 
E gave the correct 
ext was a 3-sec. delay while 
sponse and selected the next 
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Fic. 2. Stimuli used in the experiment, showing twinned stimuli on the right. 
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stimulus. ‘Training continued with no inter- 
trial interval until S went twice through the 
list without error or completed 30 trials. 
Items were presented in a different permuta- 
tion for each trial for each S, with the re- 
strictions that twinned stimuli were not 
presented successively, and if a stimulus 
appeared in the fifteenth or sixteenth position 
on one trial it would not appear first or 
second on the following trial. Orders, subject 
to these restrictions, were generated ran- 
domly on an IBM 709 computer. The 
motivation behind most of these procedures 
is to produce all-or-nothing learning of the 
unique items. The use of short common 
words as responses, and slow flash-card train- 
ing with face-to-face contact of Sand E, were 
intended to replicate some of Bower's (1962) 
The use of five responses for 16 


technique. T 
stimuli, and the elaborate randomizing 
procedures, were intended to prevent efficient 
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guessing. Five responses rather than, say, 
only two, were used so that confusions of 
twin stimuli could be distinguished from 
mere guesses. 


RESULTS 


Items with twin confusable stimuli 
(twinned items) were much more 
difficult than items with unique stim- 
uli (unique items). Mean errors were 
5.96 and 2.80, respectively. Of the 
5.96 errors, 3.81 were stimulus-con- 
fusion and 2.15 were nonconfusion 
errors. 

Unique stimulus items.—These data 
closely approximate the all-or-nothing 
assumptions with p = .30, c = .25. 
Figure 3A shows the learning curve, 
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3B shows the distribution of total 
errors per item, 3C the distribution of 
trials of the last error per item. 
'Theory and data are very close, com- 
parable to Bower's (1961) results. 
There was some gradual improvement 
on trials before the last error, as 
shown by the backward learning 
curve in Fig. 3D. 

Twin stimulus items.—Of the five 
responses available to S, one is 
correct, one is the confusion response, 
and the other three are scored as non- 
confusion errors. Recall that on the 
average, Ss responded erroneously 
with probability 1 — p = .70 to the 


A) asa function of trials (learning curve); 
ution of trial of last nonconfusion error; 


unique items which had four wrong 
answers. Since just three of these 
answers are scored as nonconfusion 
errors for twinned stimulus items, the 
probability of a nonconfusion error in 
State So should be .70 x .75 = 2525. 
Theoretical predictions for ¢ = 25 
and p = 475, using the simple 
one-process model, were calculated. 
These are the appropriate parameters 
for twin items in State So, if we count 
only nonconfusion errors. Results are 
shown in Fig. 4, in which the four 
panels correspond to the four main 
statistical results. The predictions in 
Fig. 4 are made without any esti- 
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mated parameters; the value of c is 
from the unique items, and the 
predicted p depends upon a priori 
considerations and the results on 
unique items. Panel 4B, the dis- 
tribution of total nonconfusion errors, 
shows an almost perfect correspond- 
ence. Panels 4A and 4C show that 
the errors are made on somewhat 
later trials than is predicted. The 
backwards learning curve (4D) shows 
stationarity but at a surprisingly high 
level, a mean of .625 correct, as con- 
trasted with the calculated value of 
475. 

Now consider confusion and non- 
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confusion errors together. The mean 
total errors on twinned items is 5.96. 
Mean total errors on unique items 
was 2.80, and the difference, 3.16, is 
presumed to be the mean errors made 
in the intermediate state. By the 
strategy-selection theory (Restle, 
1961) one can estimate the probability 
of learning, on an error, at 1/(1 + 7) 
where 7 is mean errors. Taking ac- 
count of the fact that some items were 
not mastered in 30 trials, d = .225. 
Since exactly two items are twinned 
in the list, it is assumed that the inter- 
mediate stage is one of two-choice dis- 
crimination. We set Q = .50. From 
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the unique items, ¢ = .25, p = .30, 
and b = c/(q + pc) = .33. These 
values were inserted in Equations 7, 
8, and 9 to compute predictions of 
total errors on twinned items. The 
calculations are compared with ob- 
servations in Fig. 5. The theoretical 
curves in Fig. 5A, 5B, and 5C are 
calculated from Equations 7, 8, and 
9.2 The agreement is close, showing 
that total errors could arise from a 
two-stage process. A one-stage model 
will not fit these data. 


DISCUSSION 


This experiment has demonstrated 
why there can be no answer to the 
question, “Is learning (especially PA 
learning) an all-or-nothing event?” In 
one list, with one set of Ss, we obtained 
all-or-nothing data on half the items and 
two-process data on the other half. 
Each process is all-or-nothing and the 
times of the several processes are in- 
dependent random variables. The sev- 
eral processes are identified as separate 
learning processes; in this we distinguish 
the present results from typical applica- 
tions of Estes’ “N-Pattern” theory 
(Estes, 1959). 

Kintsch (1963) has recently shown 
that a two-process model fits the learning 
of PA items in which the responses 
require integration. He also shows the 
connection between these results and 
Rock’s experimental method. Rock and 


? With the parameters estimated, we 
calculate that about 15 items would be left 
unsolved in the 30 trials, probably all in the 
intermediate discrimination State S; In 
Fig. 5B the total errors from Equation 8 are 
modified by adding in the distribution of 15 
items unlearned for 30 trials. This is ap- 
proximated by a binomial distribution of 30 
random responses with P =.5. Figure 5C 
is modified by noting that of these 15 items, a 
fraction Q have last error at Trial 30, QP have 
last error at Trial 29, etc. This geometric 
distribution is added to the distribution 
calculated from Equation 9. These correc- 
tions for termination of the experiment are 
only approximate, for they do not take 
account of the possible effects of trials in 
State So. 


Steinfeld (1963), using Rock’s method, 
observed  all-or-nothing performance 
when Ss pronounced nonsense-syllable 
responses. Other Ss who had to spell the 
responses, learned slower and showed 
multiple-stage learning, presumably be- 
cause the spelled responses required 
integration and the pronounced words 
did not. 

Above we have tested a simple additive 
model of the two processes of association 
learning and stimulus discrimination. 
In our model, each item is considered 
separately. A deeper analysis shows 
that twin stimuli are more intimately en- 
tangled than is said in that theory, and 
some of the consequences are seen in 
Fig. 4C and 4D. Figure 4C shows that 
the last nonconfusion (random) error 
was made on a later trial than predicted 
by the theory. What happens is that 
some items show initial guessing errors, 
then a string of confusion errors, then one 
or more guessing errors. The pre- 
solution curve, Fig. 4D, plots the propor- 
tion of correct responses on trials before 
the last random error. For this plot 
confusion errors are considered Correct 
responses and only random errors are 
scored as errors. The blocks of con- 
fusion responses before a late random 
error have served to elevate this curve. 
It seems that items get into the stimulus 
discrimination state (State S,) and then 
return to the original State So. 

What may actually happen is this. 
Suppose that S-R, and S'-R, are two 
items and S and S' are twinned. The 
S finds some strategy for remembering 
S-R, but then generalizes, and to S' he 


says Ri He knows no separate response 
to S'. Now he finds a discriminating 
st 


trategy and no longer confuses S with 
S'. He makes Ri; to S, correctly, but he 
has no response at all to S’. He now 
makes random responses to S' until he 
finds a strategy for remembering S'.R,. 
Discrimination may occur before asso- 
clation on some items. Item-by-item 
inspection of protocols gives some sup- 
port to this conjecture, in that if one 
item shows confusion followed by random 
errors, the twin item usually shows a 
string of correct responses throughout. 
This is consistent with the idea that one 
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name was learned for both stimuli, and 
then the stimuli were discriminated, 
before S ever learned the second name 
for the other stimulus. 

A correct theoretical analysis requires 
at least that the pair of twin items be 
treated together, in a fairly complicated 
Markov chain analysis. We have not 
reduced such an analysis to manageable 
calculations thus far. Furthermore, a 
whole list of items may be learned by 
interconnected systems of strategies. 
The Ss may use sentences or other com- 
pound strategies to learn several items 
at once, or may classify stimuli, or use 
other techniques to master a list. Such 
strategies can be discovered by studying 
interconnections between learning of the 
various items in a list. We attempt no 
such analysis in this paper. 

If the present theoretical position is 
correct, then (a) unitary learning is an 
all-or-nothing process, (b) many typical 
learning experiments require multiple 
processes to solution, and (c) these 
processes are each all-or-nothing, and 
with sufficient controls and proper ex- 
perimental design can be separated. 
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“i D FUNCTION OF THE ORDER OF 
EXTINCTION AS A FUNCTIO! F TI ORDER | 
PARTIAL AND CONSISTENT REINFORCEMENT} 


N. S. SUTHERLAND, N. J. MACKINTOSH, anp J. B. WOLFE 
Oxford University, England 


40 rats were trained in a straightway under partial or consistent ea 
forcement (PR or CR) or with one type of reinforcement schedu e 
followed by the other, and were then extinguished. There was no 
significant difference in resistance to extinction between Ss given ds 
training alone, and Ss given CR after PR; but giving CR before I R 
significantly reduced resistance to extinction. These results are in 
disagreement with both the discrimination hypothesis and a cognitive 


dissonance account of PR, but confirm a prediction made by a new 


explanation of the PR effect. 


Theios (1962) using rats in a 
straightway, found that resistance to 
extinction was almost as great when 
consistent reinforcement (CR) train- 
ing was given after partial reinforce- 
ment (PR) as when PR training alone 
was given. Jenkins (1962) confirmed 
this result with pigeons, and also found 
that if birds were given 10blocks of CR 
and 3 blocks of PR, resistance to ex- 
tinction was greater if PR was given 
before than if it was given after CR. 
Since, however, 3 blocks of PR may 
not have been sufficient to produce the 
basic PR effect, this result is difficult 
to interpret. The aim of the present 
experiment is to test the relative 
effects of CR followed by PR and of 
PR followed by CR with sufficient PR 
trials to produce a clear-cut PR effect. 

The results of such an experiment 
are of considerable theoretical interest. 
Sutherland (1964) has suggested that 
the PR effect may be understood in 
terms of a two-stage model of learning, 
in which it is assumed that animals 
must learn both which features of the 
stimulus situation to attend to (which 


1 This work forms part of a project on 
“stimulus analyzing mechanisms" ; the project 
is supported by the American Office of Naval 
Research (Contract N62558-2453) and the 
Nuffield Foundation. 


56 


analyzers to switch in), and which 
responses to make (Sutherland, 1959). 
The strength with which a given 
analyzer comes to be switched in 
depends upon the consistency with 
which its different outputs are differ- 
entially correlated with subsequent 
events of importance to the animal. 
Under CR training whichever ana- 
lyzers are initially switched in will 
yield outputs which are consistently 
correlated with reward, and the run- 
ning response will be conditioned only 
to the outputs from these analyzers. 
Under PR, however, no analyzer will 
yield an output consistently correlated 
with reward, and the animal should 
continually try out new analyzers: the 
running response will therefore be 
conditioned to the outputs from many 
analyzers. Provided enough trials are 
given for the strength of response at- 
tachments to be approaching asymp- 
tote, PR will produce more resistance 
to extinction than CR, since after PR. 
training the response must be ex- 
tinguished to the outputs from many 
more analyzers than after CR. 

This model predicts that when an 
animal receives both PR and CR 
training, trials to extinction will 
largely be determined by the earlier 


reinforcement schedule. If PR is 
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given first, the response will be condi- 
tioned to the outputs from many 
analyzers, and subsequent CR train- 
ing should not weaken these response 
connections. Giving CR training first 
will cause a small number of analyzers 
to be strongly switched in, and this 
will prevent the response being condi- 
tioned to the outputs from other ana- 
lyzers under subsequent PR training. 
Thus the model predicts that giving 
CR after PR will lead to greater 
resistance to extinction than giving 
CR before PR. The opposite pre- 
diction appears to be made by at least 
two other explanations of the PR 
effect, namely, cognitive dissonance 
theory (Lawrence & Festinger, 1962) 


and the discrimination hypothesis 
(Bitterman, Fedderson, & Tyler, 


1953); these theories predict that re- 
sistance to extinction will be at least 
as great when PR is given after CR as 
when it is given before. 


METHOD 
Subjects 


The Ss were 40 female hooded rats from the 
colony maintained at the Institute of 
Experimental Psychology, Oxford. They 
were experimentally naive, and aged 3 mo. at 
the start of the experiment. 


Apparatus 

The apparatus was a 6-ft. straight alley- 
way. The floor was made of chipboard and 
was 6 in. wide. The 6-in. high walls were 
made of hardboard. The first 7 in. was 
separated from the rest of the alley by a 
transparent Perspex guillotine door, and 
served as a start box. A similar guillotine 
door 3 in. before the end of the alley was used 
The goal box was 9 in. 
f similar material, and 
always contained a circular feeding bowl, 2 in. 
high and 4 in. in diameter. The apparatus 
was unpainted, and covered with 4-in. wire 


mesh. 


to prevent retracing. 
square, constructed o! 


Procedure 


eck before the start of 


Pretraining.—A W 1 
t on a feeding 


the experiment all 5s were pu 


schedule of 1 hr. food daily. Two days pre- 
training were given. On the first day, Ss were 
placed in the empty apparatus in groups of 
four, and allowed to explore for 30 min. On 
the second day, they were each given two 
rewarded runs. 

Training.—Twenty trials were given daily 
with a 6-min. intertrial interval (during which 
Ss were kept in individual detention cages). 
Running times were recorded with a stop- 
watch to the nearest 0.2 sec. from the raising 
of the start-box door to the time S's nose 
passed the guillotine door just before the goal 
box. On rewarded trials, S was allowed to 
eat in the goal box for 10 sec, On unrewarded 
trials S was detained in the goal box (with an 
empty feeding bowl) for 10 sec. 

Extinction.—Al groups started extinction 
on the day following their final acquisition 
trials. Twenty trials were given daily, with 
the same intertrial interval as during training. 
1f S failed to reach the goal box in 2 min., it 
was removed from the apparatus and kept in 
its detention cage until its next trial, All Ss 
were given 20 extinction trials on Day 1; 
thereafter they were stopped when they 
reached a criterion of three consecutive 2-min. 


trials. 


Experimental Design 


There were five groups each of 8 Ss. All 
groups were treated alike during extinction, 
but they received the following different 
treatments during training. Group P were 
given 60 PR trials, and no CR trials; Group 
P-C received 60 PR trials followed by 100 
CR trials; Group C-P received 100 CR trials 
followed by 60 PR trials; Groups C-60 and 
C-160 had no PR training and received, re- 
spectively, 60 and 160 CR trials. 


RESULTS 
Table 1 shows for each group the 
average number of trials to the cri- 


TABLE 1 


TRIALS TO EXTINCTION 


c | Meam | sp 
P 82 41 
p-C 77 36 
C-P a3 17 
C-60 17 6 
C-160 17 | 7 
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terion of extinction, namely, three 
successive failures to run in under 2 
min. The differences between groups 
were evaluated by the Mann-Whitney 
U test; the following comparisons are 
of interest. (a) The Partial group ex- 
tinguished much more slowly than the 
Consistent (U = 0, p < .001): hence, 
there was a clear-cut PR effect. 
(b) There was no significant difference 
in resistance to extinction between the 
two Consistent groups. (c) Compar- 
ing the Partial group with the two 
mixed groups, there was no significant 
difference between Groups P and P-C, 
but the difference between P and 
C-P was highly significant (U = 5, 
p < .001): thus giving CR before PR 
definitely reduces resistance to ex- 
tinction as compared with PR training 
alone. (d) Both mixed groups are 
more resistant to extinction than 
either of the Consistent groups: the 
effect is only just significant for Group 
C-P (U = 13, p < .05), and is signifi- 
cant at the .01 level for Group P-C 
(U = 9): thus giving PR before CR 
markedly increases resistance to ex- 
tinction as compared to giving CR 
training alone. (e) The most im- 
portant comparison for our purposes 
is that between the two mixed groups: 
Group P-C was significantly slower to 
extinguish than Group C-P (U = 7, 
p < .005): thus the order in which 
partial and consistent reinforcement 
are given does affect resistance to 
extinction. 

Running speeds are shown graph- 
ically in Fig. 1 for the last 10 trials of 
training and the first 60 trials of ex- 
tinction (animals which met the 
extinction criterion were counted as 
having a running time of 2 min. on 
subsequent trials). Group P-C ran 
slower than Group P over the first 30 
trials of extinction, but they con- 
tinued running for almost as many 
trials. The difference between mean 


Running Speed |/Sec 


IO Trial Extinction Blocks 


Fic. 1. Mean running speeds on last 10 


trials of training (T) and first 60 trials of 
extinction, 


running speeds of individual Ss in the 
two groups is significant at the .05 
level of confidence (U = 11) over 
Extinction Trials 11-30, 


Discussion 


The results confirm and extend those 
of Theios (1962) and Jenkins (1962). 
Both Es found (as we did) that giving 
CR training after PR led to only a small 
reduction in resistance to extinction. 
Our main result is that giving CR before 
PR produces a large reduction in resist- 
ance to extinction. This confirms Jen- 
kins' finding, and demonstrates that the 
reduction occurs in a situation where 
there is a difference between the effects 
of PR training alone and CR training 
alone. The latter difference was par- 
ticularly large in the present experiment, 
and this may be because only two trials 
pretraining with CR were given before 
training Proper began: most Es give 
Some pretraining with CR before PR 
(e.g., Theios gave 30 trials), and this 
should weaken the effect of PR and 
indeed may account for the failure of 
some investigators to obtain a clear PR 
effect. 


The predictions made by three theories 
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of PR have already been discussed in the 
Introduction. Since giving CR before 
PR weakens resistance to extinction 
more than giving it after PR, the pre- 
dictions made by the discrimination 
theory and by Lawrence and Festinger's 
theory are not confirmed. The pre- 
diction made by Sutherland’s proposal is 
confirmed. Moreover, if CR strengthens 
some analyzers, the shape of the extinc- 
tion curve for the p-C groups is ex- 
plained. If the dominant analyzers 
stay switched in longer after CR training, 
the initial effects of extinction will be to 
weaken the responses attached to them 
(for a direct demonstration of this effect 
cf. Mackintosh, 1963b) : hence, there will 
at first be a marked decrease in running 
speed. Only when the dominant ana- 
lyzers have been switched off in the 
course of extinction will the effects of the 
initial PR training which attached the 
response to less dominant analyzers have 
a chance to show. Thus over the first 20 
trials of extinction there is little differ- 
ence in the running speeds of the P-C and 
C-P groups. A difference emerges only 
when the dominant analyzers are ex- 
tinguished and at this point the C-P 
group extinguishes rapidly since the 
initial CR training has prevented the 
response from being attached to other 
analyzers, while the P-C group continues 
to run since the response is now con- 
trolled by the less dominant analyzers, 
and must be extinguished to them before 
running stops. 

The present experiment is of course 
only indirect confirmation of the theory. 
Other results (in preparation) show that 
with PR animals learn about more 


aspects of the stimulus situation than 


with CR. Moreover, the PR effect is 
only one aspect of learning to which 
the model applies. Mackintosh (1962, 
1963a, 1963b) has developed and tested 
a series of predictions made by the model 
concerning other aspects of the learning 
process. 
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EFFECTS OF DELAY INTERVALS AND MEANINGFULNESS 
ON VERBAL MEDIATING RESPONSES ! 
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Indiana University 


Delay intervals of 0, 2, or 8 sec. were interpolated between the 2 acquisi- 
= H mon 
tion stages of a mediation paradigm or between the second acquisition 


stage and the test stage. 


'The purpose of the studies was to 
assess the effects upon verbal media- 
tion of delay intervals interpolated 
between the two acquisition stages of 
a mediation paradigm or between the 
second acquisition stage and the test 
trial. The importance of temporal 
contiguity upon the establishment of 
mediating associations has been em- 
phasized since the days of Aristotle. 
Recently, using a simple reproduction 
task, Peterson, Peterson, and Miller 
(1961) demonstrated that forgetting 
may occur within seconds. If the 
factors producing forgetting operate 
during the acquisition of mediating as- 
sociations, mediated responses would 
be expected to decrease as the length 
of the delay intervals was increased. 

Two other variables known to in- 
fluence verbal mediation were in- 
vestigated simultaneously with the 
length of the delay intervals: the 
order of presentation of the items 
during the acquisition stages and 
the meaningfulness of the learning 
materials. 

In addition to the response measure 
of mediated facilitation, a second 
measure, response availability, was 


l'This research was supported by Grants 
M-5209 and 07944 from the United States 
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When the learning materials were of rela- 
tively low meaningfulness, mediated facilitation was found with del 
of 0 and 2 sec. but not with a delay interval of 8 sec. 
materials were of high meaningfulness, mediated facilit 
served with an 8-sec. delay interval as well. 
inversely related to the length of the del 


ays 
When the learning 
ation was ob- 
Response availability was 
ay intervals. 


also evaluated. Response availability 
was defined as the difference between 
empirical and a priori estimates of 
responding under a control condition. 


METHOD 


Experiment CCC;.—Consonant trigrams, 
CCCs, of from 0 to 29% association value 
(Underwood & Schulz, 1960) were used to 
construct mediation sets. The 20 consonants 
were used repeatedly among the 648 letters 
comprising the trigrams used: the median 
frequency of repetition was 19, Repetitions of 
the letters of the trigrams making up a single 
mediation set were minimized as much as 
possible. 

Table 1 presents the p 
construct mediation sets 
mediation sets 


aradigms used to 
(Column 2), sample 


(Column 3), and sample 
control sets (Column 4) for two of the 


experiments. The order of Presentation of 
the stages began with the top row and 
proceeded down the column, row by row. 
As illustrated in Table 1, Column 2, a 
mediation set began with the first stage 
Presentation of a pair of consonant trigrams, 
A-B. Stimulus A was then shown alone as a 
cue trial and SS was instructed to read off the 
stimulus and to give the response which had 
been paired with it on the previous presenta- 

The first delay interval, DI,, followed. 
In the second acquisition stage, the response 
term from the first pair, B, was shown as the 
stimulus with another trigram response term, 
C, which was then followed by the appropriate 
cue trial, B, and the second delay interval, 
Dl» In the test stage, the first stimulus 
trigram, A, was presented with the second 
Stage response, C, and two additional response 
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TABLE 1 
SAMPLES OF THE LEARNING MATERIALS FOR Two EXPERIMEN IS 
| Sample Sets 
Exp. | Paradigm 
Mediation Control 
Exp. CCCI A-B QJH NZF Qui xzr 
A | QJH QJH 
Dh (Name colors) (Name colors) 
B-C č MWB ZGW MWB 
B XZI ZGW 
DI (Name colors) (Name colors) 
A-CDE QJII MWB FHJ GNM QJ MWB FHJ GXM 
Exp. KRvri (B-C)* (heavy light)* (poor—?)* 
p (DID? (2) (bye 
B-A heavy GIL poor GIL 
B heavy poor 
DI, (Name colors) (Name colors) 
A-CDE GiL fast soap light Gi fast soap light 


alternatives, D and E. The stimulus A was 
printed above the row of three response 
alternatives. The Ss were instructed to 
verbally pair each of the response trigrams 
successively with the stimulus trigram, A, 
and then to select the response alternative 
(i.e., C, D, or E) which made the ‘best pair" 
with Stimulus A. A control set was prepared 
for each mediation set by substituting a new 
extraneous trigram for Stimulus B in the 
second-stage pair, X-C, providing a pool of 
54 mediation and their respective 54 control 
sets. Table 1 shows a sample mediation set 
and its control set in Columns 3 and 4. 

The C trigram appeared equally often in 
the first, second, or third position among the 
response alternatives on the test trials. To 
equate frequency of experience with each 
of the test alternatives, the D and E trigrams 
for a particular set were trigrams used as the 
C trigram in other sets. 

Cue trials were inserted after each ac- 
quisition stage to test the retention of the 
previous pair before proceeding with the rest 

If S erred on a cue trial, the 


of the set. 
preceding pair was reshown, then the cue 
trial was presented again, etc., until S 


responded correctly on the cue trial. 
The lengths of the first delay interval, Dl, 


(0, 2, or 8 sec.) were assigned factorially with 
the lengths of the second delay interval, Dla, 
(0, 2, or 8 sec.). Thus, the experimental 
design consisted of a 2X3xX 3 repeated 
measures factorial in which type of presenta- 
tion (mediation-control), the first delay 
interval (0, 2, 8 sec.), and the second delay 


» Assumed to have been established preexperimentally. 
b Delay interval greater than 8 sec. filled in unknown 


fashion. 


interval (0, 2, 8 sec.) were mutually orthog- 
onal, The resulting 18 cells were each 
represented three times among the 54 sets 
assigned to a particular S. The order of 
presentation of the sets so assigned was 
randomized. 

Experiment CVC ;.—Consonant-vowel con- 
sonant trigrams, CVCs, of 100% association 
value (Underwood & Schulz, 1960) were used 
as the learning materials in Exp. CVC; The 
median frequency of repetition of the 648 
letters used to construct the sets was 25; 
In all other respects, Exp. CVCr was an exact 
replica of Exp. EEG. 

Experiment CVCyur—The CVCs of Exp. 
CVC; were used as the learning materials for 
Exp. CVC viii but the order of presentation 
of the items during the acquisition stages 
differed. ‘The paradigm used in Exp. CCCs 
and CVC, (A-B, B-C, A-CDE) corresponded 
to Paradigm I of Peterson, Colavita, Sheahan, 
and Blattner (1964). They tested eight com- 
binations of the acquisition stage pairs and 
reported the greatest evidence of mediated 
facilitation with Paradigm VIII (B-C, B-A, 
A-CDE). Paradigm VIII was used in con- 
structing the mediation and their control sets 
for Exp. CVC vrr. (The subscripts for the 
experiments identify the paradigm des- 
ignation.) 

The A trigram from the first mediation 
set of Exp. CVC; was used as the A trigram 
for the first mediation set of Exp. CVCvrir. 
Similarly, the B, C, D, and E trigrams from a 
particular set of Exp. CVC, were transposed 
to their corresponding position in the Para- 
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digm VIII format. All other details were 
identical to Exp. CCC; and CVC. 
Experiment KRyi1—The CVCs of Exp. 
CVC; and CVCyri became the A components 
for Exp. KRvm, while the B components 
were stimulus words from the Minnesota 
revision of the Kent-Rosanoff norms (Russell 
& Jenkins, 1954) as shown in Table 1. The C 
components were the most frequently occur- 
ring responses (35-83%) to the chosen 
stimulus words, The median frequency of 
repetition of letters used in constructing the 
sets was 20. Paradigm VIII provided the 
at for the 36 sets. 
ub that the B-C stage had been 
established prior to experimentation, the 
paradigm investigated in Exp. KRvyrr was 
actually B-C, preexperimentally established, 
then B-A, and the test trial, A-CDE. Thirty- 
six mediation sets and their 36 control sets 
were prepared—sample sets appear in Table 1, 

Only the second delay interval could be 
experimentally manipulated and the same 
lengths were used: 0, 2, or 8 sec. The first 
delay interval was assumed to exceed 8 sec. 
in all cases. The experimental design of Exp. 
KRyvm was a 2X3 repeated measures 
factorial with type of presentation (media- 
tion-control) orthogonal to the second delay 
interval (0, 2, 8 sec.) The six cells were each 
represented six times among the 36 sets 
assigned to an S. The order of presentation 
of the sets was randomized. 

Apparatus—All learning materials were 
printed on 4 X 6 in. index cards. The delay 
intervals were filled with a color-naming task. 
Three samples of each of 10 different colors 
were randomized and mounted on a memory 
drum tape. The colors were shown at a 1-sec, 
rate. Two Hunter interval timers controlled 
the lengths of the delay intervals and the 
rotation of the memory drum, 

Experiment-subject Scheduling —To facili- 
tate experimentation, the four experiments 
were conducted separately. The order in 
which the experiments were run was ran- 
domized, yielding the sequence Exp. CVCyinr, 
CCCr, KRvrr, and CVC;. All four were 
completed within the first 2 mo. of the spring 
semester, 1963. The 132 introductory psy- 
chology students at Indiana University who 
signed up for participation were assigned 
randomly to the experiments subject to the 
restriction that 36 Ss be assigned to Exp. 
CCCr, to CVC, and to CVCyr and 24 Ss to 

Exp. KRyin. The Ss were notified when to 
report for participation. 

Procedure.—During the acquisition stage, 
Ss read aloud the letters of the syllables or 
read off the words. On the cue trials, they 


were instructed to respond with the missing 
member of the pair after reading aloud the 
first member of the pair. If S erred on the 
cue trial, the pair was presented again, then 
the cue trial was shown, etc., until S re- 
sponded correctly. The timers and the 
memory drum were activated when .S said 
the last (sixth) letter of the response on cue 
trials. During the test trials, .S successively 
paired each of the three response alternatives 
with the stimulus and then selected the 
response which made the “best pair” with the 
stimulus. No temporal restrictions obtained 
within the stages: however, Ss tended to 
respond rapidly. 

Three practice sets using different com- 
ponents in all positions were shown. The 
delay intervals were 2 and 8 sec. for DI, and 
DI» of the first practice set; 0 and 2 sec. for 


the second practice set; 8 and 0 sec. for the 
third. 


RESULTS 


Experiments CCCr, CVCr and 
CV Cvrr.—Experiments CCCy, CVG, 
and CVCyi were analyzed in one 
analysis of variance since Ss assigned 
to the experiments were essentially 
random samples from the Same popu- 
lation. Experiment KRvyri was ana- 
lyzed separately because only the 
second delay interval was varied. The 
mean number of C selections on the 
test trials for Exp. CCC, CVCi, and 
CVCyinr did not differ significantly 
nor were any interactions involving 
the three experiments reliable. 

Mediated facilitation was shown by 
the reliably greater mean number of 
test-trial selections of C following all 
mediation Presentations taken over 
the delay intervals (1.72) than follow- 
ing all control Presentations (1.66), 
F (1, 105) = 10.81, p < .01, The in- 
teraction of the type of presentation 
with the experiments was not sta- 
tistically significant. 

The variabl 
the length of ti 
polated, was a 


e of primary interest, 
he delay intervals inter- 
V ssociated with a reliable 
decrease in the number of selections 
of C on the test trials as the intervals 
were lengthened from 0 to 8 sec. 
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The mean numbers of Test-Trial C 
selections corresponding to delay in- 
tervals of 0, 2, and 8 sec. separating 
the two acquisition stages were 1.85, 
1.75, and 1.60, F (2, 210) = 11.64, 
p «.01. The means for the 0-, 2-, 
and 8-sec. delay intervals occurring 
between the second acquisition stage 
and the test stage were 1.79, 1.73, and 
1.67, F (2,210) = 3.76, p < .05. The 
means for the two delay intervals 
following mediation and following 
control presentation appear in Table 
2. Although the interaction with the 
type of presentation was not statist- 
ically significant for either the first, 
F (2,210) — 1.88, or the second, 
F (2,210) — 1.24, delay interval, in- 
spection of Table 2 suggested that the 
difference between selections of C on 
test trials following mediation and 
following control presentation (medi- 
ated facilitation) did tend to decrease 
as the length of the delay intervals 
was increased. A series of ¢ tests con- 
ducted between the mean number of 
Test-Trial C selections following me- 
diation presentation and following 
control presentation for each delay 
interval separately showed that medi- 
ated facilitation was obtained with 
0- and 2-sec. delays but not with an 


8-sec. delay for both positions of 
interpolation of the delay interval. 

The other response measure in- 
vestigated, response availability, was 
defined as the difference between the 
mean number of test-trial selections 
of C following control presentation 
with the a priori chance level of one 
selection of C per three test trials. 
It was assumed that an increase in the 
likelihood of selecting C on the test 
trials reflected some kind of sensitizing 
effect from the presentation of C 
during the acquisition stages. Re- 
sponse availability declined reliably 
over the delay intervals for both posi- 
tions of interpolation of delays. 

Since the enhancement of the test 
appearance of C was assumed to result 
from the recency of its acquisition- 
stage presentation, the paradigm with 
C in the first acquisition stage (VIII) 
should show a lower index of response 
availability than the paradigm with 
C presented during the second ac- 
quisition stage (D. The mean for 
Exp. CVCvri was 1.26 and the mean 
for Exp. CVCr (restricting the com- 
parison to identical learning materials) 
was 1.59, supporting earlier findings 
that response availability was related 
to the distance between the acquisi- 


TABLE 2 


MEAN NUMBER or SELECTIONS or C ALTERNATIVE ON TEST TRIALS 


Delay Interval 1 Delay Interval 2 
0 2 8 0 2 8 
Mediation Prey 1.95 1.80 1.63 1.86 1.83 1.69 
Control preser EY Evir 1.72 1.70 1.57 1.72 1.64 1.64 
Mediation presentation 2.36 2.42 1,90 
Control presentation 86 1.23 1.02 


KRvm 
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tion presentation of C and its appear- 
ance as a test-stage alternative (Peter- 
son et al., 1964). Following the same 
line of reasoning, it would be expected 
that the interpolation of delay inter- 
vals between the two acquisition 
stages would reduce the index of re- 
sponse availability to a greater extent 
for Exp. CVCvri with the C com- 
ponent presented during | the first 
acquisition stage than for Exp. CVC; 
with the C component presented 
during the second acquisition stage, 
'This contention was also supported : 
the decrease in the response availa- 
bility index from a 0-sec. delay to an 
8-sec. delay for DI; with Exp. CVCyyy 
was .26 and the comparable decrease 
for Exp. CVC; was .06. The decrease 
in the response availability index for 
DI» was greater for Exp. CVCy, .14, 
than for Exp. CVCyiz, .00. These 
results indicated that the reliable 
decline in response availability asso- 
ciated with DI; was produced pri- 
marily by Exp. CVCyir and the 
decline associated with DI; was pro- 
duced primarily by Exp. CVC; and 
CCC, (.12). 

Experiment | .KRyrri.—The mean 
number of test-trial selections of 
C following mediation presentation 
averaged over the delay intervals was 
2.23 and following control presenta- 
tions, was 1.04, F (1, 23) = 49.11, 
This difference was reliable beyond 
the .01 level, again demonstrating 
mediated facilitation. The mean 
numbers of test-trial selections of C 
following second delay interval values 
of 0, 2, and 8 sec. were 1.61, 1.82, and 
1.46, F (2,46) = 7.42. These differ- 
ences were reliable, 

As suggested by the means in Table 
2, the interaction between Type of 
Presentation and the Delay Interval 
was statistically significant, F (2, 46) 
= 5.26, p < .01. Further analyses 
showed reliable evidence of medi- 


ated facilitation for all three de- 
lay intervals, £a. control at 0 see. (46) 
= 11.03; fsrca- control at 2 sce. (46) = 8.71 H 
fMed.-Control at 8 see. (46) = 6.43. When 
the mean numbers of selections of the 
C alternative on test trials follow- 
ing mediation presentation associated 
with the various delay intervals were 
analyzed, the difference between 2 
and 8 sec. was reliable, ¢ (46) — 3.82, 
P « .01 but the difference between 0 
and 2 sec. was not, | (46) — .44, 
P 7.05. The reverse was true for the 
mean number of selections of the C 
alternative following control presenta- 
tion: only the difference between 0 
and 2 sec. was reliable, ¢ (46) — 2.76, 
P < .01. As with Exp. CVCyin, the 
second delay interval appeared to 
influence mediated facilitation to a 
greater extent than it influenced re- 
sponse availability. 

With Exp. KRyri the C component 
was not actually experienced during 
the acquisition Stage, rather it was 
assumed to have occurred during the 
preexperimental history of S. ‘It is 
possible that response availability 
increased with longer delay intervals 
because Ss had more time to think of 
associations to the B components 
(Kent-Rosanoff stimulus words) that 
Were presented during the acquisition 
stage. 

Discussion 


As the length of the de 
increased, the estimates 
facilitation decreased, parti 
learning materials of relativ 
ingfulness were used. When the length 
of either delay interval was 8 sec., no 
evidence of mediated facilitation - 
found for the experiments using CCCs 
and CVCs. When common English 
words were incorporated in the mediation 
paradigm, Exp. KRyqry, reliable evidence 
of facilitation was found with a delay 
interval of 8 sec, although it was lower 
than the estimates associated with 0- and 
2-sec. delay intervals, Previously, Peter- 


lay intervals 
of mediated 
cularly when 
ely low mean- 


was 
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son and Blattner (1963) reported that 
mediated facilitation increased as the 
meaningfulness of the learning materials 
increased. However, their observations 
would not explain the greater suscepti- 
bility to delay intervals of the less mean- 
ingful materials. Some insight may be 
provided by a study which showed that 
retention increased with increases in the 
meaningfulness of the stimulus materials 
(Peterson et al., 1961). In the present 
situation, it was expected that the more 
meaningful materials would be more 
likely to be retained until the test trials 
|; were less meaningful materials. 
The prediction assumed that some 
representational form of the acquisition 
associations had to be retained until the 
test trials if mediated facilitation were 
to be demonstrated. 

No test for retention was introduced 
at the end of cach delay interval because 
the temporal separation of the stages was 
one of the variables under investigation 
so no direct evidence of retention follow- 
ing the delay interval was obtained. 
Each association was tested for retention 
at the beginning of each delay interval 
to be sure the association could be re- 
peated by S. The mean number of errors 
made on the retention or cue trials of 
Exp. CCC; was 13.75; of Exp. CVCr and 
CVCyrir were .83 and .83; and of Exp. 
KRyirr was .00. One interpretation 
was that the lower the meaningfulness, 
the more errors made on cue trials, and 
the more susceptible the estimates of 
mediated facilitation were to interference 
from interpolated delay intervals. As 
Underwood and Schulz (1960, p. 4) have 
suggested, the variable of meaningfulness 
appears to wield a great influence over 
verbal measures, including mediation. 
Another interpretation of the results was 
the error analysis. The 
acquisition pairs of Exp. CCCr in 
particular, may not have been learned 
as well as the acquisition pairs of Exp. 
KRynr and these differences might 
have been reflected in the estimates of 
mediated facilitation obtained on the 
test trials. However, two features of the 
experimental procedure should be noted. 
Whenever errors were made on the cue 


than 


suggested by 


trials, the pair and the cue trial were 
repeated until S responded correctly on 
the cue trial so that the pairs of Exp. 
CCC; were actually presented more 
often than the pairs of the other experi- 
ments. Because Ss were required to read 
off the stimulus material whenever it 
was presented, at least a certain amount 
of familiarity, if not learning, should have 
accrued to the pairs of Exp. CCCy. In 
addition, when the cue-trial responses 
were correct, the pairs were not shown 
again, thus denying the experimental 
opportunity for overlearning which might 
create differences in the degree of learn- 
ing between experiments with over- 
learned and with barely learned pairs. 

Turning to the paradigms used during 
the acquisition stages, Exp. CVCviri was 
expected to show greater evidence of 
mediated facilitation than was Exp. 
CVCr because Paradigm VIII yielded 
more mediated facilitation than any 
other paradigm in an earlier study 
(Peterson et al. 1964). No reliable 
differences were observed when all sets 
were considered in the comparison or 
when the comparisons were restricted to 
an analysis of the means of sets with no 
errors on the cue trials, although the 
predicted tread was found in the latter 
analysis. 

The interpolation of delay intervals 
influenced response availability also. 
The greatest effect occurred when the 
delay interval was introduced immedi- 
ately after presentation of the C com- 
ponent. Thus for Exp. CVCvyrr with C 
presented as the first-stage response 
term, the decrease in response availa- 
bility was more marked than the decrease 
observed when the delay interval fol- 
lowed the second acquisition stage and 
the reverse was true for the two experi- 
ments with B-C learned as a second-stage 
association (CCCr and CVCr). The 
effect of interpolation of a delay interval 
immediately after the acquisition pres- 
entation of B-C was not investigated 
with Exp. KRyirr because B-C was 
assumed to have been established prior 
to actual experimentation but more than 


8 sec, must have intervened. 
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The present stu 
82 rats were giv 


in extinction. 

straight alleys. 
sucrose solution reward in 
Following 42 
was extinguished in 
negative alley. G- 
positive runway (RW)an 
RW was negative and the GB p 
speeds were recorded. G— + 


while extinction was generally fastest for 
These results were in 


were intermediate. 


conventional concept of secondary 


formulation was suggested. 


In 1953 Bitterman, Feddersen, and 
Tyler reported that (a) resistance to 
extinction following partial reinforce- 
ment of a running-jumping response 
is lessened if the goal box (GB) is 
altered at the beginning of extinction ; 
(b) following discrimination training, 
the previously negative (unrewarded) 
GB produces greater resistance to 
extinction than the positive GB; and 
(c), that Ss which receive discrimina- 
tion training and are then extinguished 
with the previously positive GB show 
less resistance to extinction than Ss 
which had received ordinary partial 
reinforcement during acquisition. 

While Finding a can readily be 
interpreted as a secondary reinforce- 
ment effect, the second result seems 
quite embarrassing to that concept, in 
that Ss receiving more secondary rein- 
forcement (i.e., exposure to the posi- 
tive GB) during extinction, extin- 
guished faster than Ss receiving less 
such reinforcement. Elam, Tyler, and 


1 The writer expresses his appreciation to 
Gordon Kimbrell and Robert Fowler for 
their aid in the collection of these data. 

Now at the State University of Iowa. 
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was concerned with secondary reinforcement effects 
en differential conditioning in a pair of 
For each S one alley (black or white) was positive (a 
goal box) while the other was negative. 
acquisition trials, 5s were divided into 4 groups. G++ 
the previously positive alley 
+ was extinguished in an alley consisting of the 
d negative goal box (GB), while for G— + the 
ositive. 
showed the slowest rate of extinction, 


and G— — in the 


Starting, running, and goal-box 


G--—. G++ and G— — 
terpreted as contrary to the 
reinforcement and an alternative 


Bitterman (1954), however, reported 
a subsequent replication of this result. 

One feature of the Bitterman and 
Elam studies which complicates inter- 
pretation is the fact that they meas- 
ured response strength at a point 
in the alley at which no differen- 
tial, reward-related cue was present. 
Thus, although GBs of different 
brightnesses were used on positive and 
negative trials, the same gray alley 
was employed on all trials. While 
this procedure may make the second- 
ary reinforcement properties of the 
GBs obvious, the reinforcement prop- 
erties of the alley—where response 
time was actually recorded—are 
ambiguous. 

The present study makes possible a 
more definitive analysis of the role of 
secondary reinforcement in extinction. 
Four groups of Ss were given differen- 
tial conditioning training in a pair of 
straight alleys, a runway (RW) and 
GB of one brightness being positive 
and the other negative. The groups 
were then extinguished under the 
following conditions: for Group ++ 
the positive RW and GB were em- 
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oyed ; for Group — —, the RW and 
D were negative. For Group +— 
e RW was positive, while the GB 
as negative and for Group — + the 
:gative. RW and positive GB were 
sed. This design permitted the com- 
iison of groups receiving different 
B cues (or degrees of secondary 
inforcement) but the same RW cues, 
| well as groups receiving different 
W cues, but the same GB cues. In 
ct, two such comparisons under 
ich condition were possible. Thus, 
1e comparisons of Group 4-— vs. 
roup — — and of Group ++ vs. 
roup — + should each reflect second- 
ry reinforcement effects in the RW 
ith GB cues constant; while the 
omparison of Group ++ vs. Group 
-— and of Group —+ vs. Group 
-— should reflect secondary rein- 
cement in the GB with constant 
W cues. According to the concept 
| secondary reinforcement, of course, 
ie first member of each comparison 
ould show better performance in 
‘tinction than the Second, since in 
ich case the first member receives 
ie greater amount of secondary rein- 
rcement—i.e., exposure to a stim- 
us previously associated with pri- 
ary reward, 


METHOD 


Subjects.—The Ss were 90 male albino rats 
rchased from the Budd Mountain Rodent 
rm, Chester, New Jersey and were about 
days old at the beginning of deprivation. 
Apparatus.—The apparatus consisted ofa 
r of enclosed straight alleys 3 in. wide and 
n. high with a start box (SB) 6 in. long, 
4-in. runway (RW), and a 9-in. goal box 
B). 

 -— start door separated the SB and 
^, while between the RW and GB was a 
d door hinged at the top of the alley 
ch S opened to enter the GB and which 
vented S from seeing the GB from the 
. The interior of one RW and GB was 
ited a flat white, while the other RW and 
were black. Both SBs were gray. The 
| were attached to the RWs by hooks and 


thus could be interchanged. A metal-lined 
indentation at the center of a wood block 
served as the food cup in each GB. 

By means of floor treadle switches and 
Standard Electric timers, starting, running, 
and GB times were recorded. Start time was 
measured from opening of the start door to 
6 in. within the runway ; running time over the 
next 12 in.; and GB time over the next 12 in., 
and, thus, to a point 6 in. within the GB. 


Procedure 


Drive maintenance and pretraining.—One 
week prior to the be sinning of pretraining, 
all 5s were placed on a 20-hr. food deprivation 
schedule with water continuously available. 
Four days prior to the beginning of acquisi- 
tion, a pretraining Schedule was initiated in 
Which S was placed in a walnut-stained box 
12 xX 8 X Gin. high, containing a glass caster 
into which had been placed 1 cc of 32¢ by 
weight sucrose solution. The S remained in 
the box for 5 min. or until it consumed the 
sucrose solution. Five such feedings were 
given each day for 4 days. Eight Ss consist- 
ently failed to consume the sucrose and were 
discarded. 

Acquisition.—Before acquisition the ani- 
mals were divided into two groups of 41 Ss 
each. For one group, the white RW and GB 
were designated as “positive” while for the 
other group, the black RW and GB were 
"positive." On each day of acquisition .S was 
placed into one of the SBs, the start door was 
opened and S remained in the alley until it 
entered the GB. A 1.cc 32% sucrose solution 
served as reward on positive trials, no solution 
being present on negative trials, 

On the first three trials 
acquisition the door between the RW and 

B was open and 
trials it was half open. 
each day with an avera 
of about 15 min. 


als (positive vs. negative) 
ch day. 


? day following 
the beginning of acquisition, extincti 4 


was used for all 


Group ++ (n = 21) 


was tested during extinction in an alley con- 
sisting of the RW and GB which had been 
Positive for those Ss during acquisition; for 
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Group +— (1 = 20) the positive RW but 
negative GB was employed; for Group — + 
(n = 20) the negative RW and positive GB; 
and for Group —— (x = 21) both the RW 
and GB were negative. In both Group +— 
and Group — +, white was positive for 10 
Ss and black was positive for 10. In both 
Group ++ and Group ——, there were 10 
white- and 11 black-positive Ss. 

Six trials per day for 7 consecutive days 
were administered during extinction. If .S 
failed to enter the GB within 3 min. after an 
extinction trial began, it was removed from 
the alley and latency for that trial was 
recorded as 3 min. 


RESULTS 


Before analysis of the data was 
undertaken, all starting, running, and 
goal-box times were converted into 
speed scores (ft/sec). 

Acquisition data.—Figure 1 indi- 
cates daily mean starting speeds to 
the positive and negative alleys during 
acquisition. (Times were not re- 
corded on the first day of acquisition.) 
Inspection indicates that Ss began to 
show clear diflerentiation between the 
alleys by the fifth day of training and 
by the final day of training mean 


hoa o 


STARTING SPEED, (FT/SEC) 
o 


Fic. 1. Mean starting 


60 


starting in the positive alley was 
about twice that in the negative. 
The acquisition data for running and 
GB speeds were essentially identical 
to that for starting speed and are not 
presented. 

Extinction data.— Because the speeds 
for the various groups converged with 
continuing extinction trials, only the 
data for the first 18 trials (3 days) 
were analyzed. When separate anal- 
yses of variance were performed on 
the three sets of speed scores, the 
relationships among the groups were 
identical for starting and running 
speeds, although somewhat different 
on GB speed. "Therefore, only start- 
ing and GB speeds are discussed 
below. 

Goal-box speeds—Figure 2 repre- 
sents level of performance during 
extinction in terms of GB speeds. 
Inspection indicates considerable dif- 
ferences in the performance of the four 
groups, during the early extinction. 
Thus Group 4-— and Group —— 
show less response strength than 
Group — + and Group ++. Group 


DAYS 


speed in the positive and negative alleys 
as a function of acquisition day. 
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GOAL BOX SPEED 
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TRIAL BLOCK 


Mean GB speed during extinction as a function of blocks of two trials. 


Fic. 2. 


T —, in fact, appeared to reach vir- 
tually complete extinction in only 2 
trials, while Group —-- improved 
during the first 4 trials. Meanwhile 
Group ++ was superior to Group 
— —, although their curves appeared 
to completely converge in about 18 
trials. An analysis of variance in- 
dicated differences between groups on 
Trials 1-18 to be highly reliable, 
F (3,78) = 19.07,» < -001, while the 
interaction of groups by trials during 
this period failed to prove significant, 
F (51, 1326) < 1.00. — . 
While the preceding indicates level 
of performance during extinction, it 
should be noted that the four groups 


entered extinction at different levels 
of performance and hence a compari- 
son of the rate at which the groups 
extinguished from their respective 
initial levels is of interest, Following 
a procedure described by Anderson 
(1963, p. 164) estimates were obtained 
for each S's initial GB speed, R(1), 
and asymptotic extinction speed, R (0). 
R (1) was defined as an S's GB speed 
on Extinction Trial 1, while R(0) was 
S's mean speed on the last day of 
extinction. Using the expression : 


RO) — (1/N')ZR (n) 
R(0) — R(1) 
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where R(n) is S's GB speed on any 
trial from 1 to 18 and N = 18,a rate of 
extinction score was computed for each 
S. Means of these scores for the 
groups were: Group +—, .049; Group 
— —, 408; Group ++, .479; Group 
— +, .997. Thus, in agreement with 
the earlier analysis, the average GB 
speed of Group +— for Trials 1-18 
was only slightly above its final ex- 
tinction level. Group —+, on the 
other hand, showed an average GB 
speed during these trials almost equal 
to its first trial speed of .84 {t/sec. 
An analysis indicated differences in 
extinction rate to be highly significant, 
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Fic. 3. Mean starting speed dui 


ring extinction as a function of block 
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F (3,78) = 13.32, p «.001. Sub- 
sequent ¢ tests indicated that Group 
— + extinguished more slowly than 
any other group (all ?'s < .01) while 
Group + — extinguished more rapidly 
than any other group (all p’s < .01). 
The difference in rate of extinction 
between Group ++ and Group — —, 
however, failed to prove reliable, 
t (40) = 0.64. 

Starting speeds.—Inspection of Fig. 
3 indicates substantial group differ- 
ences in performance during extinction 
and an analysis indicated these differ- 
ences were highly significant, F (3, 78) 
= 7.39, p « .001, while the Groups 


M 13 


TRIAL BLOCK 


s of two trials. 
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X Trials interaction failed to prove 
reliable, F (51, 1326) = 1.17. When 
rate of extinction scores were com- 
puted, the group means were: Group 
+—, .274; Group ++, -437 ; Group 
——, .462; Group — +, .964. Al- 
though these relationships were quite 
similar to those obtained for GB 
speeds, the overall differences in rate 
of extinction failed to prove reliable, 
F (3, 78) = 2.19, .10 > p > .05. 
This failure to statistically confirm 
differences in rate of extinction among 
the groups appeared at least in 
part to result from the fact that the 
groups approached different asymp- 
totes. Nevertheless, it may be co- 
gently argued that, for example, 
Group ++ was more “resistant to 
extinction” than Group +-—, since 
these groups entered extinction at 
same level of performance: for first 
trial, ¢ (39) = 0.42, while Group ++ 
subsequently maintained its perform- 


ance at a higher level, 7 (1, 39) 
= 5.20, p < .01. Similarly, Group 
—+ and Group — — entered per- 


formance at the same level, t (39) 
= 0.77, while Group — + maintained 
its performance at a higher level, 
F (1, 39) = 9.73, p < .001. Finally, 
since Group — + entered extinction 
at a lower level than Group ++, but 
subsequently performed at a higher 
level, F (1,39) = 417, p < .05, it 
may be said to have shown greater 
resistance to extinction than Group 
E 


DISCUSSION 


In general, the results for both starting 
and GB speeds indicate that Group =F 
showed the greatest resistance to extinc- 
tion and Group +— the least, while 
Group ++ and Group —— were inter- 
mediate. It is clear that certain aspects 
of these findings can be readily derived 
from the conventional concept of second- 
ary reinforcement. Thus, when RW cues 
were equated (Group ++ vs. Group 


+-— ; and Group — + vs. Group — —), 
the groups receiving secondary reinforce- 
ment in the GB showed greater resistance 
to extinction. On the other hand, when 
GB cues were equated (Group ++ vs. 
Group — +; and Group +— vs. Group 
— —), the secondary reinforcement effect 
failed to appear. In fact, those groups 
receiving secondary reinforcement in the 
RW clearly tended to show less resistance 
to extinction than the groups receiving a 
negative RW cue. Furthermore, these 
results cannot be interpreted as indicat- 
ing simply that secondary reinforcement 
is effective only when presented in the 
GB, since the performance of groups 
receiving identical GB cues varied signifi- 
cantly as a function of RW cues. Rather, 
it was the relationship between RW and 
GB cues which constituted the 
important determinant of 
performance. 

An alternative interpretation sug- 
gested by these data involves primarily 
the concept of incentive motivation. 
Spence (1956) has assumed that certain 
components (r,'s) of the consummatory 
R(R,) become conditioned to the GB on 
rewarded trials and, along with their 
afferent consequences (so's) generalize 
back into the RW and SB. The evoca- 
tion of r,-s, is assumed to contribute to 
level of motivation in proportion to the 
vigor with which it is evoked. After 
training in a differential conditioning 
situation such as involved in the present 
study, 7,-s; would presumably be vigor- 
ously evoked by the positive RW and 
GB, but only weakly evoked by the 
negative RW and GB. Further, Amsel 
(1958) and Spence (1960) have assumed 
that, once 77-5, has developed, nonrein- 
forcement of the instrumental R results 
in the evocation of an emotional R, 
frustration, components of which (ry-sy) 
will also generalize to the RW and SB. 
During extinction following consistent 
reinforcement, frustration is assumed to 
elicit responses which compete with the 
instrumental R and hence, depress per- 
formance. The vigor with which such 
7;-5; is evoked is assumed to be a positive 
function of the strength of fo- Ss. 

The hypothesis Proposed here is 


most 
extinction 


an 
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extension of this general line of reasoning. 
It is assumed that frustration occurs 
when an instrumental R is followed by a 
decrement in r,-5; and that the magnitude 
of the frustration response will depend 
on the rate and magnitude of reduction 
in rs, It is further assumed that so- 
called “secondary reinforcement” occurs 
when an R is followed by an increment in 
rss “Primary reinforcement” in these 
terms would, of course, differ from 
“secondary reinforcement” only in the 
fact that the increment is in overt con- 
summatory behavior (R,), rather than 
To-So- 

Application of these assumptions to 
the present results is facilitated by 
reference to Table 1, indicating the 
relative resistance to extinction in the 
comparisons of primary interest. Thus, 
the upper left-hand cell indicates that 
(a) Group ++ showed greater resistance 
to extinction than Group + —, (b) the 
RW cue was the same for these groups, 
and (c) that this result is presumably due 
to the greater frustration evoked follow- 
ing the instrumental R for Group p= 
than for Group ++. This last point 
follows from the present hypothesis in 
that r,-5, abruptly decreases for Group 
--— on opening the GB door, while for 
Group ++ the reduction occurs only 
after entering the GB, discovering the 
empty goal dish, etc. Furthermore, the 
magnitude of the decrement is presum- 
ably greater upon entering the negative 
GB than on entering the unbaited posi- 
tive GB. The cells in the lower row of 
Table 1 indicate that Group — + proved 
more resistant to extinction than either 


TABLE 1 


EXPECTED RELATIONSHIP AMONG Gnours 


Equated Cue 
Source of Effect Mg, Ls 


RW GB 
i Eam] 
i --»G4- 
Frustration G++>G+ >Gt 
Reinforcement Gu 4+|G++<G + 


Note.—Cells indicate the group expected to fu 
greater resistance to extinction. Columns indicate à 
cue on which incentive is equated, while rows inc ioa e 
whether the expected difference reflects a frustration 


or a reinforcement effect. 


Group — — or Group ++- This finding 
is interpreted as a "secondary reinforce- 
ment" effect in the sense in which that 
concept is here employed—i.e., for 
Group — + there is a greater increment 
in r,-5, upon entering the GB than for 
either Group ++ or Group ——- 

It should be recalled that each ex- 
pectation reflected in Table 1 was con- 
firmed statistically for both starting and 
GB speed with the exception that the 
difference in rate of extinction for Group 
— — and Group +—, while in the ex- 
pected direction for both starting and 
GB speeds, was significant only for the 
latter. It may also be noted that Group 
+-+ and Group — — failed to differ in 
rate of extinction for both starting and 
GB measures. This finding is com- 
patible with the present hypothesis, 
since, while 72-55 presumably increases 
somewhat more for Group ++ than 
Group — — on entering the GB, the sub- 
sequent frustration for that group is also 
greater. These simultaneous and op- 
posed processes might be expected to 
result in Group ++'s showing a rate of 
extinction comparable to that of Group 

Empirical support for the assumptions 
made above is not lacking elsewhere in 
the available literature. Thus, the find- 
ings that frustration appears increase 
following acquisition with large rewards, 
(Hulse, 1958; Wagner, 1961), following 
a large number of acquisition trials 
(North & Stimmel, 1960), and when non- 
reinforcement involves blocking com- 
pletion of the instrumental R rather 
than simple omission of reward (Hulse 
& Stanley, 1956; Williams & Williams, 
1943), are in each case implied by the 
hypothesis. Similarly, the facts that 
acquisition is facilitated by large reward 
and depressed by delay of reward follow 
from the assumption that magnitude of 
reinforcement (whether "primary" or 
secondary") depends upon the rate and 
magnitude of increment in consumma- 
tory behavior following the instrumental 
R. The most interesting possibility, 
however, in the writer's opinion, is that 
the phenomena conventionally and circu- 
larly "explained" as "secondary rein- 
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forcement effects" may prove capable of 
systematic reinterpretation in terms of 
implicit consummatory behavior. Thus, 
the presentation of a stimulus previously 
paired with primary reinforcement (i.e., 
overt consummatory behavior) would be 
expected to strengthen an R—or fore- 
stall its extinction—because of its condi- 
tioned capacity to evoke implicit con- 
summatory behavior, thus both lessening 
frustration and “secondarily reinforcing" 
the response in question. 
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An experiment was C 
payoff functions on t 
undergraduates, were require 
occur on each of 400 trials. 

to an all-or-none payoff function, w 
ing to a linear function. 
response 
Moreover, it was found th 
pected-utility hypothesis as op 
pothesis 
individual differences were found in 
behavior, V, and in terms of the entro 
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at Ss' beh 


Interest in decision behavior in the 
two-choice situation was generated 
through the application of statistical 
learning theory to such tasks and 
through the discovery of the reliable 
probability-matching phenomenon. 
The behavioral law of probability 
matching is well established in the 
two-choice situation (Estes, 1957). 
In the three-choice situation with no 
monetary reward Komorita (1958) 
found the effect, but Gardner (1957) 
and Cotton and Rechtschaffen (1958) 
found reliable overguessing of the 
most frequent alternative. In addi- 
tion, Gardner (1958) showed that the 
effect did not occur when the number 
of choices was increased. The relative 
frequency of response with the most 
frequent alternative tended to exceed 
the probability of that alternative, the 
difference increasing as the number of 
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It was fou 
for the 10 alternatives differed ma: 


and a generalization of it. 
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One half of the Ss were paid according 
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ind that the relative frequency of 
rkedly for the 2 groups. 


avior was compatible with the ex- 


posed to the probability-matching hy- 


In addition, large and stable 
terms of the expected value of Ss' 
py, H, of Ss' response distributions. 


alternatives increased. Siegel and 
Goldstein (1959), Brackbill, Kappy, 
and Starr (1962), and Siegel and 
Andrews (1962) found that increasing 
the reward for making a correct choice 
and decreasing the reward for an in- 
correct one resulted in relative fre- 
quencies of responses with the most 
frequent alternative which exceeded 
the probability of that stimulus. 
Edwards (1956) moreover demon- 
strated that with an asymmetric 
payoff, the less likely of two events 
will be predicted more often than the 
more likely, if the payoff for a correct 
choice of the former is sufficiently 
larger than that for a correct choice of 
the latter. 

Although statistical learning theory 
has been extended to account for the 
effects of magnitude of reward (Atkin- 
son, 1962), some problems remain in 
the application of the theory to more 
complex decision contexts (e.g., Estes, 
1962; Myers & Atkinson, 1963). An 
alternative theoretical approach exists 
which is general enough to cover the 
more complex situations. This is the 
decision-theoretic approach which pos- 
tulates that human Ss in multiple- 
choice situations execute strategies 
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which tend to increase or to maximize 
Ss’ expected utility. The weakest 
statement of the above hypothesis 
(which will be called the Expected 
Utility—-EU—hypothesis) is that the 
relative frequency of response with a 
particular alternative will be a non- 
decreasing function of the expected 
utility of the response. The expected 
utility maximization hypothesis, the 
relative expected loss minimization 
(RELM) principal (Edwards, 1956), 
and the models developed by Siegel 
(1959) are examples of EU models. . 
All of the experiments mentioned in 
the previous paragraphs are in accord 
with the general EU hypothesis. 
However, with the exception of the 
experiment by Edwards (1956), these 
experiments are also in agreement 
with a generalization of the proba- 
bility-matching hypothesis which as- 
serts that the relative frequency of 
response with a particular alternative 
is a nondecreasing function of the 
probability that the response will be 
correct (e.g., Atkinson, 1962). This 
is a result of the fact that these studies 
have been restricted to one specific 
type of payoff function, namely, an 
all-or-none payoff function in which a 
response on a given trial is either 
correct or incorrect. The present 
study is an attempt (a) to replicate 
the finding that the specific proba- 
bility-matching phenomenon does not 
occur when the number of alternatives 
is large and when monetary payofís 
are given for correct choices, and (b) 
to show that the EU hypothesis can 
be generalized to a different type of 
payoff function (i.e., a linear payofi 
function in which the payoff for a 
particular response is a linear function 
of the difference between that re- 
sponse and the response which would 
have been correct) while the general- 
ized probability-matching hypothesis 
cannot. 


METHOD 


The Ss were 36 undergraduate students en- 
rolled in a summer course in introductory 
psychology. The Ss were given course credit 
for their participation in the experiment. T he 
Ss reported individually to the dimly illumi- 
nated experimental room. They were divided 
into two groups as they appeared for the 
experiment. All odd-numbered Ss, S-1, S-3, 
etc., were in Group 1 while even-numbered Ss 
were in Group 2. The Ss were seated 10 ft. 
from a 42 X 42 in. projection screen and read 
the instructions for the experiment. 

The instructions which S read told him 
that his task in the experiment required him 
on cach of 400 trials to guess how many 1's 
would appear in each of a set of 3X3 
matrices composed of O's and 1's which 
would be projected onto the scrcen in front of 
him. Only the instructions concerning the 
payoff differed for the two groups. 
tion to the general instruction 
were given the following 
Structions. 


In addi- 
5, 5s in Group 1 
"all-or-none" in- 


Your score for the experiment will be 
determined by the number of times you 
guess the slide correctly. Each time this 
occurs, you will receive a nickel. Try to 
win as many nickels as possible. All the 
money you win is yours. 


The Ss in Group 2 were given the following 
"linear payoff” instructions. 


Your score for the experiment will be 
determined by the difference between your 
guess and the actual number of 1's in the 
slide. You will receive Payments for your 
guesses, depending upon your score. For 
example, if you guess “7” and the actual 
number is "4" the difference then is three. 
This difference will be subtracted from ten, 
and the remainder will be the number of 


Pennies you win. In this case, you would 
receive 7 cents, 


In order to make the average expected 
winnings of the two groups approximately 
equal, however, Ss 


in Group 2 were told that 
they would be allo: 


} t Wed to keep one tenth of 
their total winnings. 


Formally, if X is S's response and Y is the 
actual observed number of I's in the 3 x3 


matrix, the all-or-none Payoff function is 
represented by 


pins Scents, if X = js 


0 cents, if X x y. uJ 
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The linear payoff is given by 
1 
Py = — = P 
57 10 10 — (|X 2 
1 
a= =k = Yi: 
TS p. [2] 


The strategy which maximizes the ex- 
pected payoff in the all-or-none case is that 
of guessing the alternative with the highest 
probability of occurrence, in this case 0. The 
best strategy with the linear payoff is, in 
general, to guess the median value of the 
probability distribution. That value in this 
experiment is 3. 

After having answered any questions, E 
suggested that S help him mix the slides to 
be used in the experiment and to place them 
jn the carriage for the Kodak Cavalcade pro- 
jector (Model 510). In all, 80 slides were 
used. The second and third columns of 
Table 1 give the number and percentage of 
slides, respectively, having n 1's in the 3 X 3 
matrix, » = 0, 1: .9. The reason for using 
a stimulus distribution of this form is that 
such an inverse J-shaped distribution permits 
good separation of the modal and median 
values of the distribution. 

The S was told to give his first prediction. 
Then the first slide was shown. This was 
repeated for 80 trials, with S giving his 
prediction on each trial and observing im- 
mediately the actual number of 1's in the 
matrix. In Group 1, each time S predicted 
correctly, he was told “You win a nickel," and 
a nickel was placed in a box near S. In Group 
2, on each trial, E announced the difference 
between the prediction and the actual 
number of 1's and then announced the payoff. 
With both groups the total earnings were 


accumulated on a small adding machine. In 
neither group was S allowed to count his 
total earnings. The S was always given as 
much time as he desired to make his response. 
Average response time was about 5 sec. 

After cach block of 80 trials, it was sug- 
gested to S that he might help E reshuffle the 
slides and place them back in the carriage of 
the projector. This was done to emphasize 
further the random nature of the sequence and 
to eliminate the possibility that S might 
remember short sequences from the preceding 
block and use this information in making his 
guesses. 

After 20 trials, S was asked to estimate the 
percentage of slides containing » 1's, " = 0, 
1---9, He was given a large (60 X 40 in.) 
board with 10 vertical grooves carved in it 
and a container holding 100 marbles. The 
vertical grooves were labeled 0, 1, 2, etc., 
from left to right. The S was reminded that 
each marble represented 1% and that he was 
to distribute the 100 marbles in the grooves 
in such a way that the number of marbles in 
each groove represented what he felt to be 
the percentage of slides with each specific 
number of 1's. Thus, if he felt that 25% of 
the slides had no 1's, he should put 25 
marbles in the vertical groove labeled 0. The 
S was given as long as needed to do this task. 
After S was satisfied with his distribution, E 
recorded the number of marbles in each 
groove and scattered the marbles on the 
board in order to prevent S, the next time his 
estimates were called for, from seeing what 
he had done the previous time. These 
estimates of the proportions of the different 
stimuli were obtained after 20, 50, 80, 120, 
160, 200, 240, and 400 trials. 


TABLE 1 


MEAN ESTIMATE OF P 


AFTER 80 TRIALS AND 


ROPORTIONS OF STIMULUS ELEMENTS (NUMBER OF 1's) 


AFTER 400 TRIALS 


After 80 Trials After 400 Trials 
No. of Slides True 
No. of 1's with 7 1's Proportion 

Group 1 Group 2 Group 1 Group 2 

18 2250 A7 .2306 A750 

0 11 .1006 A167 .1056 
: 8 .0933 .0872 .0994 
2 6 .0872 .0744 .0928 
3 d j 0767 0883 0733 
4 A 0656 0800 (0617 0694 
5 i ‘0583 0733 (0578 ‘0078 
6 E ‘06004 ‘0839 (0033 ‘0839 
7 5 (0706 0789 ‘0789 0928 
1 4672 544 ATM “1400 
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TABLE 2 
Y rHE Two Groups 
zAN RELATIVE FREQUENCY OF RESPONSE FOR THE Two 
MEAN RELATIVE de e BLock or 80 TRIALS 
No. of 1's Guessed 
Blocks * s 3 3 2 5 6 7 8 9 

Group 1 2 | .1208 | .0993 | .0924 | .0792 | .0701 | .0555 | .1264 
1 1424 | 09s | 0889 | 20798 | 0597 | 0586 | .0472 | ‘0528 | 20130 2202 
“3701 | 11076 | {0701 | .0639 | 10431 | .0432 | 10424 | :0549 | ‘o4s7 | “1540 
$ "3024 | 0903 | 10819 | :0590 | .0444 | .0479 | :0320 | ‘0444 | :0528 | ‘1349 
5 4243 | .0819 | 0681 | .0521 | .0444 | .0396 | .0354 | .0417 | 0397 | ‘1598 
Group 2 | osoo | 0639 | .1132 | .1486 | .1479 | .1736 | .1035 | .0792 .0639 | .0472 
5 0819 | 10840 | .1028 | .1236 | 1361 | 1576 | .0819 | :0799 | ‘osi9 | 0702 
3 1028 | :0799 | 1118 | .1292 | .1208 | ‘1660 | ‘0854 | ‘0666 | ‘0556 | 0729 
4 -1042 | .0896 | .1243 | .1354 | .1458 | .1472 | 10764 | ‘0597 | 10549 | '0625 
5 0938 | .0729 | .1444 | .1389 | .1708 | 11396 | '0868 | 0611 | 20217 | ooo 


Finally, after Trial 400, S was asked, 
“If we were to continue and if you were 
allowed only one number as your prediction 
for the remainder of the experiment, which 
number would you select?" After responding, 
Ss were questioned briefly to determine, if 
possible, what kind of cues or what type of 


strategy they employed in performing the 
task. 


RESULTS AND Discussion 


The results of this experiment will 
be described in three sections. The 
first will deal with the relative accu- 
racy of the generalized probability- 
matching model as opposed to the EU 
hypothesis in accounting for the ob- 
served data. The second section will 
be concerned more specifically with 
the evaluation of the EU hypothesis. 
The final section is devoted to a 
discussion of some highly stable 
individual differences found in the 
data. 

Probability-maiching vs. expected 
value.—The mean relative frequency 
of response for the two groups of 18 
Ss for each of the five blocks of 80 
trials is given in Table 2. It is clear 
that in neither group is there evidence 
in the final block of trials of the specific 
probability-matching phenomena. It 


is further obvious that the responses 
of Group 2 (the linear payoff group) 
are not in accord with the generalized 
probability-matching hypothesis. 

A statistical analysis was performed 
to verify this assertion. The sta- 
tistical test employed is described by 
Page (1963). This test requires that 
the model specify a predicted ranking 
of the k(k = 10) treatment means. 
Both the probability-matching model 
and the EU hypothesis predict such 
a ranking. With Group 1, however, 
both make the same predictions of the 
ranking of the relative frequencies of 
response, thus preventing discrimina- 
tion between them in terms of ranks. 
The predicted rankings for Group 2 
are quite different. The probability- 
matching model predicts, as in Group 
1, that the relative frequency of 
response will increase as the proba- 
bility of the corresponding stimulus 
increases. The EU 


; hypothesis, on the 
other hand, predicts that relative fre- 


quency of response will be perfectly 
and positively correlated with the 
expected value of the response. Thus, 
for Group 2, a ranking is predicted by 
the EU hypothesis which is quite 
different from that predicted by the 
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generalized probability-matching hy- 
pothesis. Page's (1963) L statistic 
was computed for the data of Group 2 
for each of the predicted rankings. A 
significant L indicates significant de- 
viation of the data from the null 
hypothesis of no treatment differences 
in the direction of the ranking pre- 
dicted by the model. The L computed 
for the ranking predicted by the 
probability-matching hypothesis is 
5333.0, a value which is not significant 
at the .05 level. The L computed for 
the EU hypothesis ranking is 6000.5, 
which is significant at the .001 level. 
The data are consistent with the EU 
hypothesis and provide clear discon- 
firmation of the probability-matching 
hypothesis for Group 2. 

It might be asserted that since the 
particular probability-matching model 
of Komorita states that S matches his 
subjective probability with the relative 
frequency of response and since the 
two payoff functions may systemat- 
ically and differentially alter S's 
perception of the probabilities, then 
we do not have a direct test of this 
probability-matching model. It was 
partially for this reason that Ss were 
asked periodically to make estimates 
of the relative proportions of stimulus 
events which they had seen. The four 
rightmost columns of Table 1 show 
the mean estimate of the proportions 
of stimulus elements after the first 80 
trials and after the final block of 80 
trials for both groups. The actual 
proportion is shown for comparison In 
the third column. — i 

The differences 1n the mean esti- 
mates after 80 trials and after 400 
trials are too small to account for 
such a large difference between the 


response distributions of the two 
groups. Again, we find that the pre- 
dicted relationship between subjective 
probability and relative frequency © 
response does not receive com rmation 


when the outcomes to Ss are deter- 
mined by a linear payoff function. 

The EU hypothesis —As mentioned 
in the previous section, the L test 
indicated significant (Z = 6000.5, 
p < .001) deviation from equality of 
relative frequency of response for 
different alternatives in the direction 
of the ranking predicted by the EU 
hypothesis. The. test was applied to 
the data of Group 1 using the ranking 
of relative frequencies predicted for 
this group by the expected-utility 
position. Again the results indicate 
significant deviation from the null hy- 
pothesis (L = 6198.5, p < .001) in 
the direction predicted. Thus the 
data from both groups are in accord 
with the EU hypothesis. 

Although it is clear that the distri- 
butions of responses in the final block 
of 80 trials in Table 2 are quite 
different for the two groups, a rather 
weak statistical test was performed as 
verification and to provide an addi- 
tional test of the EU hypothesis. The 
hypothesis states that Ss will tend to 
choose the response for which the ex- 
pected value is greatest. This re- 
sponse is ''0" in Group 1 and “3” in 
Group 2. For each S, the number of 
“0” responses, /(0), and the number 
of “3” responses, f(3), were counted 
in the last block of 80 trials. The 
difference, d = f(0) — f(3) was found 
for each S in both groups, and the 
median of the distribution of d was 
found to be 9.5. The Ss were then 
classified according to whether the 
value of d was above or below the 
median. Of the 18 Ss in Group 1, 16 
were above the median on d while 16 
of the 18 Ss in Group 2 were below the 
median. This distribution of fre- 
quencies yields x? = 16.9, which, with 
df =1, is significant at the .001 level. 
The last source of evidence which is 
directly concerned with the EU hy- 
pothesis comes from S's responses 


80 


when asked which responses he would 
use if he could only use one. Sixteen 
of the 18 Ss in Group 1 selected “0.” 
The other two selected “9.” (Both of 
these Ss believed that there were as 
many “9” slides as “0” slides, as in- 
dicated by their estimates after 400 
trials.) Only one S in Group 2 
selected ''0." Thirteen Ss in Group 2 
chose either "2," "3," or “4,” 3 Ss 
chose "5," and 1 chose "6." (Re- 
sponses 2, 3, and 4 are combined since 
the difference in expected gain be- 
tween guessing ''3" every time for 80 
trials and guessing ‘'2’’—or ''4"—is 
less than 1 cent.) The difference be- 
tween the two groups in the choices 
of a single response is unambiguous 
and provides clear support for the EU 
hypothesis. 

Individual differences.—One of the 
most notable results of this experi- 
ment is that there are very large 
individual differences between Ss in 
performing this task. For example, in 
Group 1, in the final block of 80 trials, 
the range of relative frequencies for 
responding with the most likely alter- 
native, “0,” is from .125 to 1.000. 
The individual differences were much 
more marked in Ss' responses than in 
the obtained estimates, a result which 
certainly would be expected. With 
the estimates, S merely has to state 
his perception of the relative fre- 
quencies of the various environmental 
events. In making guesses, S must 
attempt to execute a strategy for 
maximizing the money he will win. 

In order to investigate the con- 
sistency of these observed differences, 
a value measure, V, was computed for 
each SS for each block of 80 trials. The 
formula for computing V is 

9 
V = © SME), [3] 
i-o 
where i is the response variable going 
from 0 through 9; f(z) is the frequency 
of response 7 in a block of 80 trials; 
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and Z;(g)is the expected gain for one 
response of i. This value is found for 
Group 1, by 


Ex(g) = [4] 


where 7 (i) is the probability of stim- 
ulus event 7, and C is the payoff for a 
correct response, in this case 5 cents. 
For Group 2, 


7 ()C, 


Ex(g) = 20) -j-il, [5] 


where summation is over all stimulus 
alternatives, j = (0, 1---9), d is the 
payoff for a correct guess (where 
j = i), and b is a scaling constant. In 
this experiment, d — 1 cent and 
b = 107', thus a correct choice is 
worth 1 cent, a choice which is off by 1, 
(|j — i| = 1) is worth .9 cents, etc. 

After having computed V for each 
S for each block of 80 trials, the 
correlations of these values between 
all blocks of trials were computed, 
separately for each group. Table 3 
shows the correlation matrices for 
Groups 1 and 2 along with the means 
and standard deviations for each 
block. Each correlation, mean, and 
standard deviation is based on data 
from 18 Ss. 


It is apparent that the differences 
between Ss are very consistent. One 
fact which is of interest is that the 
correlations below the diagonal in- 
crease as the number of blocks in- 
crease. This is especially apparent in 
Group 1. In Group 1 the correlation 
of the value measure between Blocks 
1 and 2 is .77, between Blocks 2 and 3 
it is .89, between 3 and 4, .96, etc. 
This suggests that the individual re- 
Sponse strategies are becoming more 
stable as the number of trials in- 
creases. (The large difference þe- 
tween the groups with respect to SDy 
is a result of the fact that differences 
among E;(g) for the alternatives hav- 
ing greater expected values are much 
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TABLE 3 


CORRELATIONS OF V BETWEEN BLOCKS, WITH M 


SANS AND SDs 


ror Eacu BLOCK ror GROUPS 1 AND 2 SEPARATELY 


Correlations 
Block Xv SDy 
1 2 3 4 5 
Group 1 
1 1.00 3 
2 Wr | 1,00 En 1172 
3 363°" .89** 1.00 59.44 16.28 
4 ,50f* 8/t* .96** 1.00 60.29 15.44 
5 ipo" .g4** .94** .98** 1.00 61.89 15.87 
Group 2 
1 1.00 52.56 1.17 
2 .84** 1.00 51.87 1.79 
3 .64** SIUE 1.00 52.06 1.92 
4 6e EL .91** 1.00 52.37 1.87 
5 '58** '50* A5" "35** 1.00 52.77 1.58 
* 5 <.05. 
** p <01 


greater under the all-or-none payoff 
function than under the linear payoff 
function.) 

To investigate further the consis- 
tency of individual differences the 
uncertainty H of each S's response 
distribution for each block of 80 trials 
was computed by 


H = —2P(i) log P(i), [6] 


where P(i) is the relative frequency 
of responding with response i. This 
measure is related to V but not in a 
simple way. In general, maximum V 
and minimum V depend on H.? (This 


2The mathematical problem of finding 
max Vg and min Vir is that of finding the 
maximum (minimum) of the equation 
9 
Ve = Dairy 
m 
the relative fre- 


where a; = Ei(£) and x; is th f 
; ae e ith alternative, 


quency of response with thé 
subject to the three constraints: 


1.520 
9 
2. Da 1.00 
$70 
9 > 
3. — Zxlogei = K 
i-0 


dependence will be noted by subscript- 
ing V, Vu.) H may be interpreted as 
a measure of the extent to which S 
distributes his 80 responses in a block 
over the set of all possible responses. 
The smaller the value of H, the more 
predictable S's responses. In our 
situation H can be as great as 3.322 if 
S guesses each alternative equally 
often, and as small as 0, if S guesses 
the same alternative for all 80 trials. 

Individual differences between Ss 
with respect to are highly consistent 
throughout the experiment. This is 
demonstrated in Table 4 which gives 
the correlations between H measures 
for all pairs of blocks of 80 trials, along 
with the mean and standard deviation 
of H for each block. The correlations 
for H are of the same order of magni- 
tude as those in Table 3 for V. 

In order to investigate the empirical 
relationship between V and H, the 
correlation between them in each 
block of trials was computed across Ss 
ineach group. These correlations are 
given in Table 5. The fact that all 
of these correlations are negative 
indicates that increments in the value 
of Ss' behavior are accompanied by 
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TABLE 4 
" SANS AND SDs 
^ LA is OF J] BETWEEN BLOCKS, WITH MEANS A ! 
we c7 pede FOR GROUPS 1 AND 2 SEPARATELY 
Correlations Xun SDu 
Block : " 3 4 5 
2.964 385 
Group 1 2} 
1 1.00 2.468 .671 
2 | A "RT. 2:325 ‘948 
3 44 ge “gge* 1.00 2.308 1.006 
4 25 "Bae ‘96** .98** 1.00 2.230 1.063 
3 , 
Group 2 00 2.846 489 
Mgt 1.00 2.569 .120 
2 “our “g7** 1.00 2.572 .197 
3 192** “95** .97** 1.00 2.533 .802 
$ ae ‘o1** .93** .95** 1.00 2.506 .922 
.05. 
5 $o 


decrements in the uncertainty of the 
behavior. These data are interesting 
from the point of view of the EU 
hypothesis, for if the relative fre- 
quency of response with an alternative 
is a nondecreasing function of the 
expected value of the alternative, then 
Vu will be very close to max Vy. It 
follows, in this case, that the only way 
to increase V is to decrease H. In- 
spection of the data reveals that this 
is, in fact, the process which is occur- 
ring. For most Ss the value V is close 
to or equal to max Vy. Thus the 
variation in H is interpreted as being 
responsible for much of the variation 
in V. 
TABLE 5 


CORRELATIONS BETWEEN TOTAL EXPECTED 
GAIN (V) AND RESPONSE UNCERTAINTY 
(H) ror EAcH BLOCK or TRIALS 


Block 
Group : 3 ; : > 
—A7| —87| —92| —97 | —.96 
i m —49 | —:58 | —.57 | —.69 


This interpretation of Ss' behavior 
places emphasis on individual differ- 
ences with respect to H. The quanti- 
tative study of choice behavior and 
decision making has been little con- 
cerned with individual differences and 
as à consequence, scanty information 
is available regarding such differences. 
This fact, taken with the fact that 
these results were unexpected makes 
interpretation of these stable differ- 
ences among Ss with respect to H 
somewhat speculative, 


There is one obvious interpretation, 


however. It may be that these in- 
dividual differences reflect differences 
in Ss’ 


attitudes toward risk and 
gambling. The greater the H, in 
general, the greater the risk. If S 
guesses one alternative, i, all the time 
(H — 0), he guarantees himself that 
he will get Ei(g)-n where Ei(g) is the 
expected value of one guess of the ith 
alternative and x is the number of 
trials. Deviations from an H of 0 
increase the risk associated with the 
behavior. 
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Ss were compared on reversal and non- 
i egenos ege, s dade the learning of the original problem, 
Ee ai jen training on 1 of 3 shift problems: Reversal (R), positive 
T ge sve cues reversed with 2 new irrelevant cues from the old 
Hes 8 dimension introduced; Nonreversal Old (NRO), with previ- 
cusliinclevais cues now relevant and 2 new cues from the old relevant 
dimension introduced as irrelevant; and Nonreversal New (NRN) the 
same as NRO except that the relevant cues were from the old irrelevant 


dimension but new. 


Reversal was significantly easier than either non- 


reversal condition for normals, but there were no significant reversal- 


nonreversal differences for retardates, 
significantly for either group although both found NRO 
more difficult. Retardates were significantly superior to 
in nonreversal and tended to be inferior in reversal. 


Several investigators (Buss, 1956; 
Harrow & Friedman, 1958; Isaacs & 
Duncan, 1962; Kendler & D'Amato, 
1955) have demonstrated that adults 
learn a reversal shift faster than a non- 
reversal shift. For nursery school 
children (Kendler, Kendler, & Wells, 
1960) and for rats (Kelleher, 1956) the 
opposite has been found—a non- 
reversal shift is learned more quickly 
than a reversal shift. In a study with 
kindergarten children Kendler and 
I&endler (1959) divided their Ss into 
a fast learning group and a slow learn- 
ing group on the basis of the learning 
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NRO and NRN did not differ 
slightly 
normals 


of the original problem and found that 
fast learners, like adults, found a 
reversal shift easier while slow learn- 
ers, like nursery school children and 
rats, found a nonreversal shift easier. 
It was suggested that the fast learners 
responded in a manner consistent with 
a mediational S-R theory, while the 
slow learners performed as predicted 
by a single-unit S-R theory. Kendler 
and Kendler proposed that reversal 
would be easier than nonreversal for a 
majority of children past the age of 
about 6. 

While the Kendler and Kendler pre- 
diction was made with respect to 
normal children, results in apparent 
agreement with it were reported by 
House and Zeaman (1962) for re- 


tardates of MA 6-8; i.e., these Ss 


found a reversal problem easier than 


a nonreversal problem. This finding 
is surprising, however, if the postu- 
lated role of mediation processes in 
reversal and nonreversal situations is 
considered. Kendler and Kendler 
(1962), for example, Speak of media- 
tion as involving the implicit or 
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explicit labeling of relevant and ir- 
relevant stimulus elements. Retard- 
ates might be expected to have a 
verbal deficit, and therefore a media- 
tion deficit, when compared to normals 
of similar MA. Even in cases where 
Ss are matched on a verbal task the 
verbal responses of retardates might 
not serve a mediation function as well 
as those of normals, since it has been 
suggested (e.g., Luria, 1957) that the 
retardate deficiency is one of verbal 
control over behavior rather than a 
language deficit per se. If normal 
children learn a reversal shift faster 
than a nonreversal shift due to the 
utilization of mediation processes, and 
if retarded children suffer some sort 
of mediation deficit, then an inter- 
action of IQ group by type of shift 
would be expected. That is, the 
retardates’ reversal-nonreversal per- 
formance should be more similar, or 
performance should be reversed with 
nonreversal easier, when compared to 
the reversal-nonreversal learning of 
normals. 'The House and Zeaman 
results are not consistent with this line 
of reasoning since their MA 6-8 re- 
tardates performed as Kendler and 
Kendler would expect normals of that 
MA to perform. However the House 
and Zeaman results are difficult to 
interpret in this respect due to the 
absence of a normal comparison 
group, and procedural differences from 
studies using normals. In addition Ss 
in their study received 125 over- 
training trials on the original problem, 
a factor which could have served to 
retard nonreversal shift (Bensberg, 
1958; Mackintosh, 1962). 

The present study compared nor- 
mals and retardates of similar mental 
age on reversal and nonreversal shift 
problems. 

METHOD 


Subjects —The Ss were 48 normal third- 
grade children and 48 mentally retarded 


children. Normals and retardates were of 
similar MA as determined by the Peabody 
Picture Vocabulary Test. Normals were from 
two public schools, retardates from a special 
public school summer session and Southern 
Wisconsin Colony. 

Apparatus,—The apparatus was a modi- 
fied Wisconsin General Test Apparatus, 
303 in. high and 24 in. wide, with 144-in. 
wings slanting away from the front of the 
apparatus at an angle of 120°. Stimuli were 
introduced on a white plastic tray which was 
mounted on wheels and could be presented 
to S through a 7-in. high door below a one-way 
mirror. The door was closed between trials. 
Two circular hollows, 53 in. apart, served as 
reward cups. Stimuli were slid over the cups 
in grooves on the tray in such a way that S 
was unable to lift a stimulus from the tray, 
but had to slide it away from him to expose 
the reward cup. The reward was a ''cocoa 
puff” (General Mills’ cocoa flavored cereal). 

Stimuli were ł-in. high wooden blocks 
mounted on 4 X 4% in. rectangles of white 
plastic. The shapes were not matched for 
area or volume, but were constructed so that 
there were margins of approximately } in. 
between the edges of the shape and the edge of 
the plastic base. Forms were an equilateral 
triangle, a circle, a square, and a T. The 
triangle was presented with the apex toward 
S. Colors were red, blue, green, and yellow. 
In the original problem the red and the blue 
circle and triangle were designated Set 1 
stimuli, and the green and the yellow square 
and T Set 2 stimuli. Four different stimuli 
composed of two forms and two colors were 
used for each S. These four stimuli were 
presented in pairs which varied simultane- 
ously in form and color. (For example, for 
Set 1 stimuli the red circle was paired with the 
blue triangle for half the trials and the red 
triangle with the blue circle for the other half.) 

Design.—The design was essentially a 
2X3X2 X2 X2 X 2 factorial involving 
two IQ groups (retardates and normals), three 
conditions (R, NRO, and NRN), two dimen- 
sions in the transfer problem (color and form), 
two cue sets within each dimension, positive- 
negative (each cue of each set was positive 
once and negative once), and two sexes. 
Each of the 96 cells was filled by one S. Sex 
and positive-negative were often confounded 
in an effort to keep condition subgroups 
balanced with regard to MA, institutionaliza- 
tion, trials to criterion in the original dis- 
crimination, and number of Ss requiring 
special training. There were 10, 9, and 11 
institutionalized retardates in Retarded Sub- 
groups R, NRO, and NRN, respectively. 
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Following a preliminary examination of the 
data only groups, conditions, and dimensions 
were used in the analysis, with all other 
factors collapsed. Thus for analysis there 
were 12 cells with 8 Ss per cell. : 
After original learning each S was assigned 
to one of the following shift problems: Re- 
versal (R): The previously negative cue be- 
came positive and the previously positive cue 
became negative. Two new irrelevant cues 
from the old irrelevant dimension replaced the 
old irrelevant cues. Nonreversal Old (NRO): 
The previously irrelevant cues became rele- 
vant, one positive and one negative. Two 
new cues from the old relevant dimension 
were introduced as irrelevant. Nonreversal 
New (NRN): The new cues from the previ- 
ously irrelevant dimension became relevant, 
one positive and one negative. Two new cues 
from the old relevant dimension were intro- 
duced as irrelevant. 

Two features of these conditions are note- 
worthy: (a) All conditions had new irrelevant 
cues in the shift problem, thus giving each S 
a “cue” to the change in the stimulus-reward 
contingencies and (b) the old relevant cues 
were removed for both nonreversal shift 
conditions thus precluding perseveration to 
the old positive cues. 

Procedure.—Before the start of training S 
was given four trials without the stimulus 
blocks. For these trials the reward was placed 
alternately in the right and left reward cups. 
The baited cup was covered slightly more on 
successive trials with a white plastic rectangle 
that was identical to the stimulus bases. The 
last trial was given with the reward cup com- 
pletely covered. On each trial S was required 
to slide back the cover, if necessary, and take 
the reward. Immediately after these four 
trials S was given the following instructions: 


Now I'm going to show you four blocks, 
twoata time. If you push back the right 
block there will be a cocoa puff under it; 
if you push back the wrong block there will 
be nothing under it. You may choose only 
one block each time. I want you to try to 
get as many cocoa puffs as you can. 


Discrimination training was then started. 

The order of presentation of the pairs of 
stimuli as well as the position of the positive 
stimulus on each trial followed a Gellerman 
sequence. A noncorrection method was used, 
and S was required to learn the problem to a 
criterion of 10 consecutive correct responses. 
If .S failed to meet his criterion within 80 
trials he was given the special training 
described below. However, if .S's last 7 
responses in the 80 trials were correct he was 


given 3 additional trials to meet criterion. 
If .S then failed to meet criterion after 83 
trials the special training described below was 
given on the following day. Immediately 
after S met criterion on the original problem, 
and with no further instructions, he was given 
training on either R, NRO, or NRN. This 
training continued for 80 trials or until S5 met 
a criterion of 15 consecutive correct responses. 

In order to avoid artifacts that would be 
produced by a high percentage of nonlearners, 
Ss who failed to learn the original problem 
within 80 trials were given special training on 
the following day. The E first showed the 
positive stimuli to S saying, “These blocks are 
always right. There will always be a cocoa 
puff under these blocks." Then the negative 
stimuli were presented with the words, ‘These 
blocks are always wrong. There will never 
be a cocoa puff under these blocks." Training 
then proceeded as usual. Four Ss who did not 
meet the criterion within the subsequent 20 
trials were dropped from the study. 

Two Ss not available for training on the 
second day (e.g., absent from school) were 
discarded. 


RESULTS 


The mean MA was 124.4 mo. for 
normals and 115.5 mo. for retardates. 
Although the groups differed some- 
what with regard to mental age they 
were quite similar with respect to 
mean OL errors (23.86 for retardates ; 
23.40 for normals). Mean IQ was 
70.7 for retardates and 110.4 for 
normals. Matching within groups on 
the basis of MA, IQ, and OL errors 
was such that in no case did the R, 
NRO, or NRN means within either 
group differ from the appropriate 
overall mean by more than one stand- 
ard error. Assignment of special 
training Ss resulted in 6, 9, and 7 
normals; and 7, 5, and 7 retardates in 
Groups R, NRO, and NRN 
spectively, 

The mean number of errors made in 
reversal and nonreversal js shown in 
Fig. 1. A preliminary examination of 
the data revealed no main effects or 
interactions due to sex of S, and this 
factor was dropped from further 
analyses. A three-way analysis of 
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Fic. 1. Mean number of errors made by 
normals and retardates during reversal (R) 
and two nonreversal shift conditions (NRO 


and NRN). 


variance of the error data involving 
groups (normal and retarded), condi- 
tions (R, NRO, and NRN), and 
dimensions (color and form) is sum- 
marized in Table 1. This table shows 
that, across groups, reversal Ss made 
significantly fewer errors than non- 
reversal Ss at the .01 level, with no 
significant difference between the two 
nonreversal conditions. The Groups 
X Conditions interaction was signifi- 
cant only in the comparison of 
reversal and nonreversal (R vs. NR). 
Figure 1 shows that the interaction 
arises from a marked decrement in the 


TABLE 1 
ANALYSIS OF VARIANCE OF ERRORS 
Source df MS F 
Credo HE ced eed 
Ponaltions (od 1 | 2310.19 | 10.19** 
NR A NRN 1 | 529.00 | 2.33 
DENM 1 | 1837.50 | 8:11% 
aegis om) 2 | 1044.03 
5 ** 
G XR vs. NR 1 | 2028.00 | 8.9. 
cS € NRO vs. NRN 1 60.06 27 
D X e 2 | 340.72 
D 1 | 60919 | 2.69 
X RRO vs 72.25 03 
D € NRO vs. NRN 1 1225 
GxCXD 3 n n" 
G XD XR vs NR 1 s 1 
M bd 1 | 17556 7 
Within s4 | 226.63 
Dp <.01 
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normals’ performance on either non- 
reversal problem. Color was a signif- 
cantly more difficult dimension than 
form (p < .01) but there were no 
significant interactions of the dimen- 
sion factor with other variables. 

Subsequent Duncan range tests re- 
vealed that the normal reversal group 
made significantly fewer errors than 
either normal nonreversal group 
(p « .01), but that there were no 
significant differences among the re- 
tardate groups. Duncan range tests 
were also used to compare normal and 
retardate performance under R, NRO, 
and NRN conditions. Only the NRO 
test reached the .05 level, although 
the NRN comparison approached that 
level. 


DISCUSSION 


The results of Ythis study confirm 
the Kendler and Kendler (1959) predic- 
tion that a reversal problem is sig- 
nificantly easier than a nonreversal prob- 
lem for a majority of normals past the 
age of 6. This prediction, however, can- 
not be extended to include retardates 
since the present study found no sig- 
nificant difference between reversal and 
nonreversal learning in retardates with a 
mean MA of approximately 9-8. These 
results are not in agreement with those of 
House and Zeaman, who found reversal 
learning significantly easier than non- 
reversal learning for retardates of MA 
6-8, but are consistent with the notion 
that the mediation processes, in which 
the retarded are presumably deficient, 
are important in reversal-nonreversal 
situations. As previously mentioned, the 
use of an overlearning procedure may 
have been an important factor in the 
House and Zeaman results, although 
other procedural differences between the 
House and Zeaman study and the present 
experiment make consideration of the 
different results speculative. 

g In the present study there was no 
significant difference between NRO and 
NRN nonreversal problems for either 
normals or retardates, but both groups 
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found the NRN problem slightly. casicr. 
Familiarity with, or "learning to ignore 
irrelevant cues may retard the sub- 
sequent learning of these cues to some 
degree, although for normals the reversal- 
nonreversal factor was of much more 
importance than the NRO-NRN proce- 
dure difference. . 

The present study indicates differences 
in the learning processes of normals and 
retardates of approximately the same 
mental age. While the normals' per- 
formance was such as to suggest that 
mediation processes were utilized (re- 
versal easier than nonreversal), the re- 
tardates' performance did not indicate a 
similar involvement of mediation. How- 
ever, if the retardates had been com- 
pletely unable to utilize mediational 
processes, reversal should have been 
more difficult than nonreversal, as has 
been found in studies with young children 
and rats. These results might, of course, 
be due to the presence in the retardate 
group of some Ss who utilized mediation 
Processes and some who did not. Such 
an interpretation of similar reversal- 
nonreversal results was proposed by 
Kendler and Kendler (1959), who then 
related the mediating behavior of their 
Ss to performance on the original prob- 
lem. A comparison of the shift perform- 
ance of fast and slow learners was not 
possible with the present paradigm since 
errors in original learning could not be 
separated from a dimension preference, 
If original learning errors reflected the 
chance assignment of Ss to preferred or 
nonpreferred dimensions—the fast learn- 
ers having been assigned to their pre- 
ferred dimensions and the slow to their 
nonpreferred dimension—errors in orig- 
inal learning would be expected to 
correlate positively with reversal shift 
errors and negatively with nonreversal 
errors. Correlations between original 
learning errors and shift errors were in 
these directions. The S's “degree of 
mediation" could not be related to MA, 


IQ, or CA since all of these measures 

failed to correlate significantly with 

number of errors on the nonreversal shift 
problem. 
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INCREASING CREATIVITY BY FREE-ASSOCIATION 
TRAINING 


JONATHAN L. FREEDMAN ! 


Stanford University 


An experiment w 


as conducted to test the hypothesis that facilitating 


S's ability to produce associations would increase his score on a test of 


creativity. 
the words. 
cantly higher on the 


Ss either free associated to 10 stimulus words or defined 
"Those receiving the free-association training scored signifi- 
Remote Associates Test (RAT), a test of creativity 


which was administered after the training session. 


Maltzman (1960) has proposed that 
creative thinking consists of bringing 
together various associations which 
are novel but are also appropriate to 
the task. To test this idea he and his 
associates have conducted a series of 
studies (Maltzman, Bogartz, & Bre- 
ger, 1958; Maltzman, Simon, Raskin, 
& Licht, 1960) in which Ss free 
associate to a series of stimulus words 
under a variety of conditions and then 
take the Unusual Uses Test (Guilford, 
1950). The one procedure which has 
been consistently successful in im- 
proving performance on the Unusual 
Uses Test (as compared with control 
groups) is presenting the stimuli a 
number of times and requiring Ss to 
give a different response to each 
stimulus on each presentation. 

A recent experiment by Caron, 
Unger, and Parloff (1963) indicated, 
however, that this training procedure 
did not affect performance on the Re- 
mote Associates Test (RAT). Since 
Mednick (1962) has shown that Ss 
who have a relatively large number of 
associations to a given stimulus tend 
to score higher on the RAT than those 
who have fewer associations, a train- 
ing technique which facilitated free 
associating should improve perform- 
ance on the RAT. The failure of 
Maltzman's procedure may have been 


1 The author is grateful to Zeta Brown 
who assisted in the running of this study. 
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due to the fact that it is designed to 
increase the production of unusual 
associations, whereas the RAT is 
constructed so that the correct re- 
sponses are commonly associated with 
the stimulus words, and are often 
dominant associates (e.g., MOUSE- 
CHEESE). Since it appears that suc- 
cess on the RAT depends upon the 
production of many associations in a 
short time, the present study at- 
tempted to improve performance on 
the RAT by a training technique 
specifically designed to facilitate this 


process. 
METHOD 


Subjects —The Ss were 90 students from an 
introductory psychology class at Stanford 
University. There were 20 males and 20 
females in each of the two main experimental 
conditions, and an additional 10 males in a 
control condition. The Ss were run in- 
dividually. 

Procedure—All Ss were told that they 
were going to take a test designed to measure 
creativity and that before the test they would 
receive a brief warm-up exercise. The 
facilitation group was instructed to give their 
associations to each of 10 stimulus words. 
Each word was read aloud by E and S was 
told to give whatever words came into his 
mind, and to continue until told to stop. 
They were stopped after 30 sec. or when they 
had stopped responding. The nonfacilitation 
group was asked to define each of the words. 
The same words were used for both groups, 
and were chosen so that neither the words 
themselves nor any of their common asso- 
ciates were the correct responses to any of the 
items on the creative thinking test. The 
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stimulus words were: BOOK, MOON, COPPER, 
MOVIE, BRASS, COFFEE, READ, FARM, TO- 
GETHER, STAR. . 

An additional control group consisted of 
10 male Ss, each of whom was paired with 1 
male S in the facilitation condition. The 
procedure was identical to that in the facilita- 
tion condition except that the control Ss read 
from a card the associations which had been 
given by the paired experimental S. Thus, 
in each pair both Ss had verbalized the same 
words, but the facilitation S had produced 
them himself, whereas the control S merely 

hem. 

€ creative thinking test was the Remote 
Associates Test (RAT). This test is com- 
posed of items consisting of three words, and 
S's task is to find a fourth word which is 
somehow associated with each of the three 
stimulus words. The test consists of 30 items 
and takes 40 min. It was presented in two 
equal parts of 15 items, and the original 
warm-up exercise was repeated after the first 
part of the test. Scores on the test consist 
simply of number correct. (For a more com- 
plete description of the RAT see Mednick, 
1962.) 

A pretest similar to the present study 
indicated that the experimental manipulation 
was more successful with men than with 
women. Since the pretest was run entirely 
by a male E, it seemed likely that female Ss 
were constrained in free associating in the 
presence of a male E. Therefore, in the 
present study all men were run by a male E 
and women by a female E. 


RESULTS AND Discussion 


The major results are presented in 
Table 1, which shows the score on the 
RAT for facilitation and nonfacilita- 
tion groups and for male and female 
Ss. The means for the complete test 
are presented because the results are 
similar for the two halves of the RAT. 
For both men and women the facilita- 


TABLE 1 
MEAN Scores on RAT 


Males Females 
Facilitation 18.058 18.60 
Nonfacilitation 14.15 17.10 


a Maximum score is 30. 
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TABLE 2 


ANALYSIS OF VARIANCE OF MEAN 
Scores on RAT 


Source df F 
Facilitation-Nonfacilitation (F-NF) 1 9.82** 
Male-Female (M-F) 1 4.13* 
F-NF X M-F 1 1.94 
Error (MS) 76 | (14.84) 

* 5 <.05. 
** p <.01 


tion group achieved higher scores than 
the nonfacilitation group. As may be 
seen in the accompanying analysis of 
variance (Table 2), the difference 
between facilitation and nonfacilita- 
tion groups is significant. Separate 
comparisons using the combined error 
term indicated that the difference is 
also significant for males alone; 
F (1, 76) = 10.25, p «.01, but not 
for female Ss, F (1, TOY = 1.59. 

It is difficult to determine why the 
effect is so much stronger for male 
than for female Ss. There is, of 
course, a confounding of E and sex of 
Ss, but an extensive pretest using a 
single E produced essen tially the same 
pattern of results. There were no 
appreciable differences in the number 
of associations produced by the two 
groups during the training session. It 
is possible that the difference is due to 
a ceiling effect for female Ss since the 
overall means for females are signifi- 
cantly higher than for males. 

_ It could be argued that the facilita- 
tion Ss scored higher on the RAT be- 
cause during the training session the 
former produced a large number of 
associations to the stimulus word 
whereas the latter merely defined it. 
Although the associations were not 
the exact responses necessary for the 
RAT, they may have indirectly 
strengthened the correct responses 
through the process of mediated gen- 
eralization (Mednick & Freedman, 
1960). The control group was run to 
assess this possibility, 
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The control Ss had available to 
them the same body of words that 
were available to the facilitation Ss, 
but the former had not actually free 
associated. If the availability of the 
words were the critical factor, the con- 
trol group should have scored as high 
as the facilitation group on the RAT; 
if the act of associating were critical, 
the control group should score less 
high and should not differ from the 
nonfacilitation group. The latter 
result was found. The control group 
mean was 14.56 which was signifi- 
cantly lower than the paired facilita- 
tion group as indicated by a test for 
correlated means; t (9) = 2.31, p < .05, 
and did not differ appreciably from 
the nonfacilitation group. Thus, it 
appears that the actual act of associa- 
ting is the critical factor which facili- 
tated performance on the RAT. 

Why did this training procedure 
have a facilitating effect when Maltz- 
man's technique did not? The ex- 
planation does not seem to be a 
difference in the amount of training 
since the present procedure seems to 
have given less training than the 
standard Maltzman technique. The 
number of responses per stimulus was 
fewer in the present study (a mode? of 
three vs. six in Caron's study) ; fewer 
stimuli were used (10 vs. 25) ; and the 
total time taken by the training was 
considerably less (about 5 min. vs. 
about 20 min.). 

There was probably a difference in 
the remoteness of the associations 
ing training in the two 
Unfortunately, it is im- 
the remoteness of 
n in the present 


given dur 
studies. 
possible to assess 
the responses give 

2 The mode is probably the most descrip- 
tive measure of central tendency since a large 
majority of Ss gave exactly three responses 
to each stimulus but a few gave a Very large 
number of responses. 


study since no appropriate norms 
exist. The Ss did not give a number 
of responses to the same stimulus, but 
were giving a series of associations 
with, in most cases, only the first one 
being specifically to the stimulus word. 
It may be that this gives Ss practice 
associating to several stimulus words 
ata time, which is what he must do 
on the RAT. 

Another possibility is that, as dis- 
cussed earlier, the present technique 
causes Ss to produce a number of 
associations in a brief period whereas 
the Maltzman method requires only 
one association at a time. Since Ss 
probably solve the RAT problems by 
trying to produce many associations 
quickly, the present training tech- 
nique is more similar to the process of 
solution than is Maltzman's; and the 
former might therefore be expected to 
have a greater facilitating effect than 


the latter. 
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PROBABILITIES ! 
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Ohio State University 


2-choice prediction behavior was studied for a task in ah Std 
probabilities were conditional upon values of the stimuli. S was re- 
quired to guess which of 2 meter readings would be higher given one as 
a stimulus; readings were drawn in pairs from 2 separate distributions. 


4 groups of Ss made 100, 300, 500, or 700 


predictions, following which 


they judged distribution means and various probabilities associated 


with the readings. 


accurate. 


The manner in which S deals with 
the classical, two-choice, probability- 
learning situation is fairly well es- 
tablished at an empirical level (Ed- 
wards, 1961; Neimark & Shuford, 
1959; Restle, 1961; Siegel & Gold- 
stein, 1959). Although disagreement 
exists as to whether the underly- 
ing mechanisms involve probability 
matching, schema formation, extreme- 
asymptote generalization, expected- 
utility maximization, or some com- 
bination thereof (Edwards, 1961; 
Estes, 1950; Grant, Hake, & Hornseth, 
1951; Restle, 1961; Siegel, 1959), it is 
apparent that S can—and under some 
conditions does— produce response fre- 
quencies which parallel closely the 
probabilities of alternative stimuli. 


l'This research was carried out in the 
Laboratory of Aviation Psychology and was 
sponsored by the Advance Research Projects 
Agency, Department of Defense, under Con- 
tract No. AF 33(657)-11718 with the Ohio 
State University Research Foundation, and 
monitored by the Aerospace Medical Division, 
Air Force Systems Command, United States 
Air Force. Permission is granted for re- 
production, translation, publication, use, and 
disposal in whole or in part for any purpose 
of the United States Government. 
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Asymptotic prediction frequencies did not match 
conditional probabilities even though judg 
Prediction functions suggest th 
for each stimulus distribution beyond whi 
meter. Empirical cutoff data support th 
further, that the location of cutoff point 
means as knowledge is gained over trials. 


ments were surprisingly 
at S establishes cutoff points 
ch he chooses the alternative 
his interpretation suggesting, 
S approaches the distribution 


Such probability matching has even 
been reported for situations in which 
the choice required "probabilistic dis- 
crimination” (Estes, Burke, Atkinson, 
& Frankmann, 1957), a finding which 
supports nicely the model proposed by 
Burke and Estes (1957). The basis 
for discrimination in this study was 
the frequency with Which specific 
stimulus elements (lights) appeared in 
samples drawn from a set of 12 such 
elements. Therefore, any given stim- 
ulus sample was representative of one 
of two sets of sampling rules and each 
response was reinforced in accordance 
with the rule in effect at the time. 

A slightly different problem involv- 
ing probabilistic discrimination is one 
in which reinforcement is administered 
on the basis of direct comparisons of 
individual stimulus elements (herein 
termed values) representing the two 
sampling processes. The present 
study employs a task which permits 
the investigation of this problem. Ag 
in the case of the Estes et al. study, 
the basic difference between this and 
the classical task is that the alterna- 
tive stimuli (A and B) assume any of 
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a number of values rather than merely 
occurrence vs. nonoccurrence on any 
given trial. Here, however, S ds 
provided with the value (reading) for 
only one stimulus and is required to 
guess whether it or the other stimulus 
(not given) has the higher reading. 
On successive trials, pairs of A and B 
readings are drawn randomly from 
separate A and B populations; thus, 
S can benefit from past experience by 
formulating increasingly reliable esti- 
mates of the distribution parameters. 

The present task permitted the in- 
vestigation of several related prob- 
lems. The first objective was to 
determine the extent to which asymp- 
totic frequencies of choice at each 
value of A and B match their condi- 
tional probabilities—i.e., the proba- 
bilities of A or B being greater given 
(a) a specific value of A, or (b) a 
specific value of B. As in the classical 
problem, such behavior would not 
tend to optimize correct guesses, al- 
though it would be precisely the be- 
havior predicted by Estes’ (1950) 
theory. Evidence for conditional 
probability matching is, in fact, 
provided by the Estes et al. (1957) 
study (see Fig. 3). A second objective 
was to determine the manner in which 
behavior changes over trials; since the 
present task involves a large number 
of conditional statements rather than 
a pair of unconditional ones, it would 
seem that asymptotic performance 
would be reached more slowly than in 
the classical situation. A final ob- 
jective was to determine, insofar as 
possible, what it is that S acquires 
over trials. Neimark and Shuford 
(1959) broached this issue for the clas- 
sical problem by attempting to relate 
estimates of probability to choice 
frequencies. Unfortunately, the oper- 
ation of estimating appeared to influ- 
ence that of predicting SO that the 
results on this point were inconclusive. 


In the present study the approach was 
to use separate groups for the various 
(four) levels of practice with each 
group making estimates following the 
completion of their trials. As will be 
discussed below, estimations were 
obtained for a number of distribution 
characteristics. 


METHOD 


Subjects —The Ss were 24 male and 16 
female undergraduate students enrolled in an 
introductory psychology course. They were 
divided at random into four groups of 10 each 
(6 males and 4 females). 

Apparatus.—The display consisted of a 
10 X 24 in. panel upon which was mounted a 
pair of microammeters with scale markers 
reading from 0 to 200 pa. in units of 5. The left 
meter was labeled A and the right one B. The 
meters could be set independently to any 
desired reading by adjusting potentiometers 
in circuits Írom a power supply to each meter. 
A switch in each circuit afforded E manual 
control over the presentation of readings on 
one or both meters. Another pair of switches 
was located beneath the display panel for use 
by S in making his response. His choice of 
meters was indicated to E by an A or B light 
which was activated by the A or B response 
switch. In addition, either response switch 
made the circuits from the power supply to 
both meters so that S, in responding, would 
get feedback from both meters (i.e., he would 
see both the A and the B readings for that 
trial). 

The S was seated at a table upon which 
were mounted the display and corresponding 
response keys. A partition separated him 
from E whose apparatus was located on the 
table behind the partition. On each trial E 
made settings on both potentiometers, gave 
a verbal ready signal to S, closed the switch to 
one of the meters (i.e., presenting a reading), 
and waited for S’s response. The S then 
pressed either the A or B response switch 
according to which meter he believed was set 
to the higher reading. His choice was 
signaled to E by a light and the readings on 
both meters were displayed as feedback. 
Release of the switch by S terminated the 
trial. Approximately 4 sec. intervened be- 
tween trials during which time responses were 
recorded and settings were made. 

Procedure.—AÀs noted above, the task re- 
quired S to guess which of the two meters on 
his display had the higher reading given the 
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A 


Probability of Occurrence 


B 


24 28 32 36 40 


Reading 


Fic. 1. An illustration of the distributions of readings sampled on 


reading for only one of them (the stimulus 
reading). On successive trials the stimulus 
reading appeared alternately on Meter A and 
Meter B so that S would have equal oppor- 
tunity to sample both meters. After each 
response, of course, S was presented both 
'eadings (A and B) as knowledge of results. 
The readings to be used on any trial were 
drawn at random and independently from two 
10rmal populations: one from the A popula- 
ion and one from the B population. These 
opulations of readings were defined, as 
hown in Fig. 1, with means of 19 and 25 and 
"s of 6 units (hence the means differed by 1¢). 
nstructions contained no mention of the 
xistence or nature of the underlying popula- 
ions; a major purpose of the study was, in 
ict, to determine how much S learned about 
iem and to what extent he reflected his 
nowledge in his behavior. One restriction 
laced upon the sampling procedure was that 
iere be no tied readings; whenever this oc- 
irred, that pair was discarded and a new 
iir was drawn. All pairs for a given session 
ere entered on program sheets for E's use in 
anual presentation of the readings. 
Trials were administered in blocks of 50 
th summary feedback (number of errors) 
ovided after every 100 trials. Groups were 
fined by the number of trials performed: 
oup I—100; Group II—300; Group m= 
); Group IV—700. In each daily session 
e the first, 200 trials were completed; in 
first session only 100 were administered. 
lowing the completion of all trials, each S 
; required to estimate (a) the average 


Meter A and Meter B. 


adings from each 
distribution would occur, (c) the number of 


(or B) would read 


ay. 
ons for this portion of the study 


to read and were 


RESULTS 


The statistical nature of the task, 
as illustrated in Fig. 1, was such that 
S could optimize correct predictions 
by adopting the following rule: choose 


| only if A, when presented, reads 
higher than the 


, choose B. The 

performance ap- 
mum can be esti- 
ting the empirical 
cutoff points at which S switches his 
prediction from the meter on which a 
reading is presented to that on which 
it is not. That is, an estimate of the 
A distribution cutoff is defined as the 


extent to which 
Proaches this opti 
mated by calcula. 
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point midway between the highest B 
value at which A is chosen and the 


lowest B value at which B is chosen; 


for the B distribution, it is midway be- 
tween the highest A value at which 
B is chosen and the lowest A value at 
which A is chosen. Such cutoff points 
were determined for all groups on the 
last 100 trials, and the means are 
shown in Fig. 2. One set of curves in 
the figure reflects the performance of 
different groups at various levels of 
practice, whereas the others are based 
upon a single group (IV) at the same 
levels. The optimum cutoff points, 
ie., distribution means, are also 
plotted for purposes of comparison. 
Over the 700 trials it is clear that the 
cutoff points diverge and approach an 
asymptote near—but not on—the 
optimum levels. A 1 test revealed 
that the difference between A and B 
cutoffs for Group IV at 700 trials is 
highly significant (P < .01). The on 
for the B cutoff scores (Group 1V— 
700 trials) was .47 and for the A cutoff 
scores was .62. Therefore, the B dis- 
tribution mean lies within the .05 
confidence interval of the B cutoff 
scores, whereas the A distribution 
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B Meon 


Cut off Point 
Group IY Ao----O 


22.0 


Stimulus Reading 
n 
o 


Y 
o 
o 


300 500 T00: 


Trials 
Fic. 2. Mean cutoff points obtained for 
the two distributions plotted for a single 
£roup and the four separate groups after 
comparable periods of practice. 
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A Meon (Actual) 


Estimates of Distribution Means 


300 


Trials 


Fic. 3. A comparison of mean estimates 
with actual mean values for both distributions 
at all levels of practice. 


mean lies outside the .05 confidence 
interval for the A scores. This means 
that B cutoffs cannot be regarded as 
significantly different from their op- 
timum level whereas A cutoffs can. 

Figure 3 summarizes the average 
estimates of distribution means for A 
and B readings made by Ss at varying 
levels of practice. Here, of course, 
trials and groups are completely con- 
founded. An analysis of variance 
carried out on these data indicated 
that the A vs. B effect is significant 
(p < .01) but the practice (group) 
effect is not (p > .05). This suggests 
that S develops a fairly good estimate 
of thetwo means in 100samples (trials), 
which estimate improves little—if at 
all—over the next 100 trials. 

In contrast to mean judgment, 
estimates which are intended to reflect 
Ss' knowledge of the form of the A and 
B distributions were too unreliable to 
be of any use. The responses of some 
Ss suggested that they judged, instead 
of the probability of occurrence of vari- 
ous readings, the conditional proba- 
bility of a meter being correct given 
these readings. Such judgments were 
in marked contrast to those of other 
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Ss who apparently interpreted the in- 
structions correctly. 

The extent to which S responds in 
accordance with conditional proba- 
bilities is described in Fig. 4. Here, 
the response proportions for the B 
meter are compared with the proba- 
bilities of the B meter reading higher 
when various A and B readings are 
presented. In order to obtain more 
reliable estimates of both probabilities 
and proportions (especially for low- 
probability events), the data are 
grouped over Ss and over limited 
ranges of readings. Response propor- 
tions are based upon the last 100 trials 
for each group and conditional proba- 
bilities upon the A and B stimulus 
populations. To simplify the pre- 
sentation, only the data for Group I 
and Group IV are included; those for 


1.00 


© 
o 


Proportion of B Responses 


20 


00 .20 40 


Conditional 


Fic. 4. 


trials) compared. 


Proportion of times Meter B was chosen over Meter A 
with different conditional probabilities of B > A; Group I (100 trial. 


the other two groups lie well within 
these limits and are very similar in 
form to Group IV. It is clear that 
none of the functions in Fig. 4 are 
described adequately by the diagonal 
line which represents the probability- 
matching hypothesis. The data for 
100 trials (Group I) are obviously 
closer to this hypothesis than those 
for 700 trials (Group IV); with 
practice, apparently, the function 
becomes increasingly sigmoid. In 
neither group do there appear to be 
any systematic differences attribu- 
table to the stimulus. It should be 
noted, in regard to these functions, 
that the maximum number of correct 
guesses would be obtained if the pro- 
portion of B choices dropped directly 
from 1.00 above an actual probability 
of .50 to .00 below this point. Such a 


Group Stim. 
I 


I 
p 
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given A or B readin 
s) and Group IV (700 
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1.00 
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60 
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Proportion of B Judged Greater 
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60 80 1.00 
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Fic. 5. Median proportion of times Meter B was judged greater than Meter A given 
readings with various conditional probabilities of B > A; Group I (100 trials) and Group IV 


(700 trials) compared. 


function would result if S were to 
adopt the strategy described earlier in 
which cutoffs are located at the A and 
B distribution means. 

Turning to the data which directly 
reflect knowledge gained over trials, 
Fig. 5 indicates that S does, indeed, 


Predicted 


700 


o 100 300 500 
Trials 
median 
ed, was 
r than 


Fic. 6. A comparison of the 
proportions of times Meter B was judg 
predicted, and was actually greate! 
Meter A at four levels of practice. 


come to achieve a fairly good under- 
standing of the conditional proba- 
bilities involved for the various stim- 
ulus readings. Again, data are plotted 
only for the extreme levels of practice 
(100 and 700 trials), and it is clear 
that judgments are far closer to 
actual probabilities on Trial 700 than 
on Trial 100. On the earlier trials, S 
appears to underestimate high proba- 
bilities and overestimate low ones. 
Even after 700 trials there appears to 
be slight tendency to underestimate 
high conditional probabilities. 

The last judgment required of S 
was that of the simple probability of 
the B meter reading higher than the A 
meter (irrespective of stimulus). As 
shown in Fig. 6, these judgments im- 
proved between 100 and 300 trials but 
little thereafter. An interesting par- 
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allel is shown in this figure between 
probability judgments and propor- 
tions of B choices on prediction trials. 
Both functions approach an asymp- 
tote after 300 trials at a level very 
close to the actual proportion of times 
B is greater than A. 


DISCUSSION 


The present results indicate that 
asymptotic response frequencies do not 
match conditional probabilities associ- 
ated with the stimuli in a complex 
version of the two-choice guessing game 
and, further, that the discrepancy be- 
comes greater as S gains knowledge of the 
probabilities involved. It is equally clear 
that this behavior is not attributable to 
S's failure to learn the statistical param- 
eters of the task: the data reveal that he 
judges the mean, the simple B > A prob- 
ability, and even the conditional prob- 
abilities (B > Ala, B > Alb, A > Bla, 
A> B|b) with considerable accuracy 
after several hundred trials. He simply 
prefers, it seems, to make his choices on 
some basis other than the maladaptive 
probability-matching principle. The na- 
ture of his strategy, while not revealed 
unequivocally by the data, appears to be 
one intended to maximize the number of 
correct guesses: i.e., choose a stimulus 
reading for both meters below which you 
will predict the alternative meter to be 
greater; set these points at readings for 
which the conditional probability of suc- 
cess is .50 (in this case, the mean of the 
distribution of readings on the other 
meter). Such a strategy is essentially 
that proposed by statistical decision 
theorists to account for a wide range of 
behavioral phenomena (Swets, Tanner, 
& Birdsall, 1961). Figure 4 indicates 
that asymptotic choice of the B meter 
does, indeed, approach 1.00 as the con- 
ditional probability of B > A increases 
above .50. That it does not actually 
become a step function, as would be the 
case if S were to maximize correct 
guesses, is partially due to the fact that 
data are combined over Ss. So long as 
there are individual differences in the 
choice of cutoff points, such treatment 


must result in a degree of curvature; so 
long as there are individual differences 
in the judgment of means, it is probable 
that cutoff points will vary. From the 
data summarized in Fig. 3, it is apparent 
that there is still some constant and 
variable error in mean judgments even 
after 700 trials. 

It should be noted that the present 
results are at variance both with the 
Burke-Estes (1957) theory and with the 
empirical data supporting it (Estes et al., 
1957). In the latter study, the response 
probability to each of 12 stimulus ele- 
ments was ". . . virtually equal to the 
conditional probability of reinforcement 
[p.236]" The discrepancy between the 
Estes results and those of the present 
study suggests that reinforcement based 
upon direct comparison of elements from 
the two sampling processes (as in the 
present study) leads to the more rapid 
acquisition of an optimal choice strategy. 
Thus, the less-optimal, probability- 
matching strategy reported by Estes 
would be expected ultimately to be re- 
placed by the more-optimal decision 
strategy observed in the present study if 
training were continued. Just such a 
transition has been reported by Edwards 
(1961) for the simple two-choice situation. 

The frequency with which the B meter 
is chosen as greater than A, regardless of 
reading, approaches an asymptote at 300 
trials (Fig. 6). This is also consistent 
with Edward's (1961) findings for the 
simple two-choice situation. It is quite 
apparent, however, that changes in be- 
havior continue well beyond 300 trials 
as illustrated by the cutoff data in Fig. 2. 
Most of the improvement is with regard 
to the location of the B distribution 
cutoff point which, it will be recalled, is 
estimated from the magnitude of A 
values required to effect a choice of the 
B distribution. Since A values are in- 
frequently. larger than B values, and 
hence receive fewer reinforcements when 
presented, it is not surprising that S 
should require more trials to reach 
stability in his B cutoff point, 

It is interesting to note that, if condi- 
tional probabilities are disregarded, the 
proportion of A and B predictions closely 
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match their simple probabilities of occur- 
rence (the last points in Fig. 6—78:22 
vs. 76:24). Such data might lead to the 
erroneous conclusion that S completely 
ignores stimulus readings and does, in 
fact, match the probabilities of reinforce- 
ment associated with the two meters. 
The fallaciousness of this interpretation 
is clearly shown in the empirical cutoff 
data; for S to choose Meter A, Stimulus 
A must, on the average, be above 25.6 
or Stimulus B below 20.6. Were he to 
ignore the readings completely, there 
would be no reliable cutoff points at all, 
and were he to consider the two sets of 
readings as drawn from a common 
population, a single cutoff point would 
appear. Obviously, his predictions are 
influenced both by the magnitude of the 
readings and by the distribution from 
which they are drawn. 

A final point of interest is found in the 
data for judgment of conditional proba- 
bilities (Fig. 5). Median judgments are 
most accurate for probabilities below .50; 
higher probabilities are consistently un- 
derestimated. Improvement over trials 
appears largely in a reduction of this 
underestimation. In spite of the fact 
that B > A occurred much more fre- 
quently than A » B in the prediction 
trials, judgments of the two kinds of 
events show surprisingly similar trends 
over the full range of conditional 
probabilities. 
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FUNCTIONAL STIMULUS LEARNING AS RELATED TO DEGREE 
OF PRACTICE AND MEANINGFULNESS! 
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Functional stimulus learning for paired nonsense syllables, as measured 
by recognition of stimuli paired with specific responses, varied directly 
with both stimulus meaningfulness and degree of S-R practice, with 
scores exceeding S-R anticipations until a common asymptote was 


attained. 


Further, letter identification indicated that more letters of 


the stimulus were utilized as the functional stimulus under high than 
low meaningfulness, suggesting a convergence between the nominal 
and functional stimulus as meaningfulness increases. 


Stimulus learning during paired-associate 
practice has ordinarily been measured by R-S 
recall. In terms of the distinction between 
the nominal and functional stimulus (Under- 
wood, 1963), such measures reflect the amount 
of nominal stimulus learning occurring during 
S-R acquisition. However, as Underwood 
and Schulz (1960) have indicated, with 
nonsense syllables as stimuli, a single element 
may serve as the functional stimulus. For 
such terms, recall of the remaining elements 
may represent an incidentally learned integra- 
tion beyond that required for establishing 
discernible cues. 

R-S recall for nonsense syllables has been 
demonstrated (Jantz & Underwood, 1958) 
to increase with S-R practice until an asymp- 
tote is reached which is considerably below 
the S-R asymptote. Similarly, a monotonic 
relationship has been found (Cassem & 
Kausler, 1962) between stimulus meaningful- 
ness (M) and amount of R-S recall. This 
experiment extended these two variables to 
the functional stimulus, with stimulus learn- 
ing being measured by both a recognition and 
a letter identification procedure. 

Method.—A 4 X 2 mixed design was em- 
ployed, with S-R practice (6, 12, 18, or 30 
trials) as "between" and stimulus M—high 
(H) or low (L)—as “within” variables. There 
were 14 undergraduates in each of the four 
practice groups. For the 6-, 12-, and 18-trial 

groups, S received that number of trials 
whether or not a criterion of one perfect trial 
had been attained. The number of Ss reach- 
ing criterion was 0, 0, and 2 for 6, 12, and 18 
trials, respectively. In order to avoid over- 
learning, Ss in the remaining group received 
1 This study is based on a thesis submitted by the 


d t. Louis University 
t author to the Graduate School, S i 
ao fulfillment for the Master of Arts degree. 


30 trials or trials to criterion, whichever came 
first (8 Ss reached criterion; mean number 
trials = 23.14). The list was mixed, with 
5 HM (mean Archer, 1960, value = 83.5%) 
and 5 LM (mean = 12.8%) stimulus terms. 
For control purposes, two combinations of the 
list were prepared, differing in specific S-R 
pairings, with three different serial orders for 
each list. Intralist similarity was made as 
low as possible. Practice was at a 2:2-sec. 
rate, with a 4-sec. intertrial interval. 

A recognition trial followed S-R practice, 
in which S was presented cards, one at a time, 
containing responses on the left side and 
three alternative stimuli on the right. One of 
the stimuli was correct, the other two being 
stimuli associated with other responses in the 
list (one LM and one HM). The S was in- 
structed to encircle the correct stimulus. For 
letter identification, S was given a mimeo- 
graphed sheet, following the recognition trial, 
containing all S-R pairings, and asked to 
encircle the letters of the stimulus utilized in 
associating the response to that stimulus. 

Results.—Mean recognition Scores, cor- 
rected for guessing, are given in Table 1, to- 
gether with summary S-R data. An analysis 
of variance revealed a significant effect on 
recognition for Trials, F (3, 52) = 10.04, 
b <.001, and Meaningfulness, F (1.52) 
= 27.30, P <.001. Although the Trials X M 
Interaction suggested a trend, with HM scores 
reaching an asymptote relatively early in 
practice and LM later, the effect was not 
significant, F (3, 52) = 1.96, p> 10. As 
would be expected, functional stimulus learn- 
ing progresses more rapidly than S-R learn- 
ing (see Table 1), with the latter requiring 
additional processes. Further, as is to be 
expected, functional stimulus learning, unlike 
R-S recall, reaches the same asymptote as 
S-R acquisition. That is, complete S-R 
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TABLE 1 
SUMMARY DATA FoR STIMULUS RECOGNITION AND S-R ACQUISITION 
Stimulus Recognition S-R Acquisition 
Meaningfulness Meaningfulness 
Tri 
rials HM LM HM LM 
M SD M SD M SD M SD 
6 2.86 1.63 2.00 1.32 .07 1.49 1.21 .89 
12 146 179 2.75 1.86 3.21 1.48 2.64 1.74 
18 446 :79 2.86 1.74 4.28 282 2.64 1.45 
30 168 320 4.25 1.28 4.57 194 4:36 193 
learning is impossible without the prior 


establishment of reliable functional stimulus 
cues. 
Mean letter identification scores (maxi- 
mum = 15) for the 6-, 12-, 18-, and 30-trial 
groups, respectively, were 11.36, 11.64, 12.28, 
and 12.64 for HM and 7.43, 8.57, 8.21, and 
9.57 for LM. An analysis of variance indi- 
cated a pronounced elfect for M, F (1, 52) 
= 46.24, p < .001, but the Trials and Trials 
XM effects fell far short of significance 
(Fs«1). These results suggest that there may 
bean increasing convergence between the nom- 
inal and functional stimulus as M increases. 
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ZYELID CR 


WITH A MASKED CONDITIONING PROCEDURE! 


MARVIN J. HOMZIE? ANp GERALD WEISS 


State Unitersity of Iowa 


60 human Ss were given 50 eyelid conditioning trials while engaged in a 
probability-learning task introduced for the purpose of masking the 


conditioning experiment. 


half of the trials a strong 2.0-psi air 
onset of the 500-cps CS; on the other 


puff was presented alone without th 
these conditions were reversed. Since the average UCS 


group (L^) 


For the high-reinforcement group (H^) on 


puff occurred 500 msec. after the 
half of the trials a weak .33-psi air 
e CS. For the low-reinforcement 


intensity was identical for the 2 groups, their drive levels (D) presum- 


ably were equated. The H' 
than the L' group (p < .005). 
previous 
cedures. 


study which employed more 
It was concluded that habit strength (H) of a defense CR is 


group reached a higher performance level 
These results were in agreement with a 


"standard" conditioning pro- 


a function of the intensity of the UCS. 


In an earlier study reported by Spence, 
Haggard, and Ross (1958) it was found that 


!This study was carried out as part of a project 


concerned with the influence of motivation on perform- 
Contract Nonr-1509 (04), 


ance in learning under fs doni] 
Project NR 154-107 between the State University ¢ 
Iowa and the Office of Naval Research. The project is 


under the direction of Kenneth W. Spence. |. 

The first author was a United States Public Health 
Service Predoctoral Fellow MPM-16,385 during the 
period of this investigation. 

Now at the University of Denver. 


a group of Ss that had a strong UCS on the 
half of the trials that involved pairing of the 
CS-UCS and a weak UCS alone on the other 
half of the trials gave a significantly higher 
number of CRs than a group in which the 
strengths of the UCS on the two types of 
trials were reversed. Because level of drive 
(D) in the two groups was equated this 
finding was interpreted as lending support to 
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a reinforcement-type interpretation that 
habit strength (//) in aversive conditioning is 
a function of the intensity of the UCS. The 
present experiment replicates the above study 
with the conditioning embedded (masked) 
within the context of a probability-learning 
task. The advantage of this procedure is that 
it attenuates the effects of human cognitive or 
set factors (Spence, 1963; Spence, Homzie, & 
Rutledge, 1964). The purpose of this study 
was to determine if this procedure would 
produce data consistent with previously ob- 
tained results. Additional procedural differ- 
ences between the two studies are: the 
absence of a ready signal, 500-cps tone as the 
CS instead of a light, and a shorter intertrial 
interval. 

Method.—A total of 60 men and women 
students from an introductory course in psy- 
chology formed the two experimental groups, 
each of which contained 15 men and 15 
women. The apparatus was identical to that 
of Spence, Homzie, and Rutledge (1964). 
A CR was recorded whenever the Brush 
recording pen showed a deflection of 1 mm. 
or more in the range 150-500 msec. following 
the onset of the CS. 

For the high-reinforcement group (H’) a 
strong puff (2.0 psi) always occurred on a 
conditioning trial, whereas a weak puff (.33 
psi) was presented alone without the CS. 
For the low-reinforcement group (L’) these 
conditions were reversed, the weak puff being 
presented on the conditioning trials and the 
strong puff being presented as an extra 
stimulus. The order of presentation of con- 
ditioning and extrastimulus trials was pre- 
arranged according to an irregular order in 
which the number of each was equalized in 
blocks of four trials. 

The Ss were given a set of instructions to 
the effect that the experiment was concerned 
with the effects of distraction on performance 
in a difficult problem-solving situation. They 
were told that their task was to predict, on 
the onset of a central signal light, which of 
two small lamps located to the left and right 
of the signal light was going to light up and to 
signify their prediction by pressing one of two 
push buttons located on the left or right arm 
of their chair. They were told that distracting 
stimuli in the form of an air puff to their eye 

and a tone would be given between their 
response of pressing a button and the lighting 
up of one of the small lamps. The onset of 
the CS (500-cps tone) came 2 sec. after the 
onset of the signal light and had a duration 
of 2,550 msec. The duration of the signal 
light was 4,550 msec. On conditioning trials 
the onset of the UCS (puff) followed the onset 
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of the CS by 500 msec. and lasted for 50 
msec. On extrastimulus (UCS alone) trials 
the UCS was presented 4,500 msec. after the 
onset of the signal light. Five hundred milli- 
seconds after the termination of the signal 
light one or the other of the small lights came 
on and remained on for 1 sec. The intertrial 
period was at predetermined intervals of 9, 12, 
and 15 sec. averaging 12 sec. for each S. 
Fifty conditioning (i.e., paired CS-UCS) trials 
were given. 

Results and discussion.—Figure 1 presents 
the frequency curves of conditioning for the 
high-reinforcement group (H’) and the low- 
reinforcement group (L'). Also included in 
Fig. 1 are the performance curves from a study 
previously reported by Spence, Haggard, and 
Ross (1958) for Ss conditioned under "'stand- 
ard" conditions, i.e., without the probability- 
learning task. The curves labeled Gp. H and 
Gp. L are the conditioning curves for the 
high- and low-reinforcement groups, re- 
spectively. As may be seen, in both experi- 
ments, the curve for the high-reinforcement 
group (H', H) is well above that for the low- 
reinforcement group (L', L). In agreement 
with the previous findings, evaluation of the 
performance levels over the last 20 condition- 
ing trials by means of the Mann-Whitney U 
test resulted in a significant difference be- 
tween Groups H’ and L’ (p < .005). 

In view of the procedural differences, the 
results of the two experiments are remarkably 
similar. Thus, conditioning embedded in the 
context of the probability-learning task 
produces results consistent with those previ- 
ously established under more standard condi- 
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Gp.L' 
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tions. These two experiments indicate that 
with level of drive (D) equated, performance 
in classical aversive conditioning is a function 
of the intensity of the UCS occurring on the 
reinforced trials. These results may be inter- 
preted as supporting a reinforcement-type 
theory that habit strength (H) in classical 
aversive conditioning is a function of the 
intensity of the UCS occurring on the rein- 
forced trials. 
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OF FREE RECALL FROM WORD-ASSOCIATION 


MEASURES: A REPLICATION 


ARTHUR M. BODIN, LEWIS A. CRAPSI, MARILYN R. DEAK, 
THEOBOLD R. MORDAY, AND LAURENCE D. RUST 


State University of New York at Bufalo 


Mean recall frequency of 99 Kent-Rosanoff words was investigated as a 
function of cue number, a measure of intralist association. Word cards 
were shuffled and viewed for 5 sec. each by 15 female and 13 male under- 
graduate Ss, 17-19 yr. old. Following a 15-min. rest, Ss had 5 min. for 
written recall. Results showed a significant relationship between cue 
number (Nc) and mean recall frequency. Peaks and valleys graphed 


from replication and original 


unique to one word, suggesting t 


not random effects. Cue linkage 


association strength 


data matched and coincided with Ne's 
hese fluctuations reflect particular, 
increased with Nc and depended on 
between cuing and cue-linked words, replicating 


original findings, and, originally unreported, favored early recall. 


'This study was an attempt to replicate 
Rothkopf and Coke's (1961b) investigation of 
the relationship between the free recall of each 
word in a word-association list and cue number 
(Nc). As computed by Rothkopf and Coke 
(1961a), Nc is the number of other words in 
the list which elicited a particular word more 
than 11 times or as one of the 10 most frequent 
associates in the 1952 Minnesota norms of the 
Kent-Rosanoff Word-Association Test (Rus- 
sell & Jenkins, 1954). Thus, Nc quantifies 
the degree of intralist associations, which 
Deese (1959, 1960) showed to be positively 
related to the number of items recalled. 

Method.—The 100-word Kent-Rosanoff 
list was modified by the deletion of CHEESE, 
leaving 99 words and permitting equal divi- 
sion into three sublists for presentation in turn 
to three Ss per sitting. Each word was typed 
in capitals on a separate 3 X Sin. white card. 

Twenty-eight undergraduate students at 
the State University of New York at Buffalo 
served as Ss without pay. These 15 females 
and 13 males ranged from 17 to 19 yr. of age. 

The stimulus card deck was shuffled and 


split into three numerically equal subdecks, 
placed, one before each S, face up, but 
covered with a blank top card. 

Instructions were read aloud as follows: 


This experiment will take about half an 
hour. You will hold a stack of 3 X 5 cards. 
On each card there is a different word, 
except for the card on top, which is blank. 
You inspect each card for a few seconds, 
until I say, "Next"; then turn it over onto 
the table and inspect the next word. Once 
we start, please don't talk till we're finished 
with the whole experiment. I'll tell you 
when that is. Do you have any questions? 
—All right: when I say, "Begin," turn the 
top card over and start. 


Subdecks were inspected at the rate of 5 sec. 
per card, then shuffied separately by E and 
reassigned, until each subdeck had been 
completed by each S, in rotation. A 15-min. 
rest pause was introduced as follows: 


Now please fill this out. [The E handed 
each S a biographical data form.] Then 
take a rest pause in the hall right outside 
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this door till we call you back for the next 
in about 10 minutes; but please do not 
discuss this experiment. 


After the interlude, Ss were given 5 min. to 
recall the stimulus words and were instructed 
as follows: 


When I say, "Begin," write down all the 
words you can remember from the cards 
you looked at earlier. Write each word ona 
separate page. Do not tear off any pages. 
Write as many words as you can. I will 
stop you in 5 minutes. Begin. 


Results and discussion.—Results have been 
plotted in Fig. 1 as mean recall frequency of 
words having the same Nc vs. Ne, beneath 
which, in parentheses, is the number of words 
in the deck having that Ne. It will be noted 
that the peaks and valleys obtained in this 
replication are consistently congruent with 
those obtained by Rothkopf and Coke 
(1961b). And the fluctuations in mean recall 
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frequency, which seemed in the original data 
to reflect mere random variations, now appear 
to be functions of the particular words in- 
volved. This impression is strengthened by 
the fact that the peaks and valleys, with only 
one exception, occur in conjunction with an 
Nc to which only one word contributed data. 
Thus, no opportunity for idiosyncrasies to be 
averaged out was afforded by these instances, 
that is, they precluded possibilities for other 
words to smooth or balance out any recall 
effects peculiar to particular words of a given 
Nc. Restating this evidence, every Nc which 
was unshared, that is to say, unique to one 
word, coincides with either a peak or a valley 
in recall frequency. 

The consistently higher mean recall fre- 
quencies obtained by Rothkopf and Coke 
(1961b) may be a function of a difference in 
set. The original unpublished instructions 
included the statement, "Look at the words 
carefully because after you have seen all the 
words, I am going to ask you to remember as 


i Original Data a} — — [s] 


© Replication Datagy — O 


14 15 16 


1 2 3 1 0 0 


Fic. 1. Mean recall frequencies for words of each cue number, 
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many of them as you can.""! The replication 
instructions, however, made no reference to a 
future recall task. Since Ss in the two experi- 
ments may have differed in relevant ways 
other than the set induced in them, this 
interpretation of a consistent difference in 
results as a function of a difference in instruc- 
tions cannot be advanced as more than a 
possibility, attractive because of its face 
validity. 

The product-moment correlation coefficient 
between recall frequency and Ne was .40 
(p <.001), a result in essential agreement 
with that of the original study (r = .62, 
p «.001). 

An analysis of variance showed that vari- 
ance between Nc was significantly greater 
than within-Nc variance, F (14, 84) = 9.02, 
p «.001. Again, the replication result agrees 
closely with the result of the original study, 
F (14, 84) = 7.81, p < .001. 

Rothkopf and Coke (1961b) suggested that 
the effect of Nc on recall might be accounted 
for by a cuing mechanism having greater 
strength at higher Nc levels. To measure 
such a cuing mechanism, a tally was made of 
cue linkage, i.e., the number of instances in 
which each word was recalled immediately 
after a word for which it was an association 
response. The cue-linkage frequency of each 
word was divided by the corresponding num- 
ber of opportunities for cue linkage, the latter 
being each word's frequency of recall in any 
ordinal position except the first one in a list. 
The ratio of cue linkage to opportunities for 
cue linkage appeared to be an increasing 
function of Nc. The analysis of variance 
relevant to the present data showed that 
variance between Ne was significantly greater 
than within-Ne variance, F (13, 59) = 2.07, 
p «.05. Again, this result confirms the 
original findings, F (13, 59) = 5.60, p < .001. 

To find whether cue linkage depended on 
association strength between cuing and cue- 
linked words, a three-way tally was made, 
according to whether the association fre- 
quency of the latter was greater than, equal 
to, or less than the median association fre- 
quency of the relevant normative associations 
to the cuing words as computed by Rothkopf 
and Coke (1961a). Of 124 cue linkages in 675 
recalled words, 81 were above the median, 
16 were equal to the median, and 27 were 
below the median. The 16 cases at the median 


1E. Z. Rothkopf, personal communication, Feb- 


ruary 26, 1962. 


were then split equally into the above and 
below median categories, and a binomial test 
was performed on the data (Siegel, 1956, pp. 
36-42). The two-tailed test was significant 
(s = 4.76, p <.001). Again, the present 
results support the findings of the original 
study (s = 7.31, p < 001). Rothkopf and 
Coke (1961b) concluded: 


Words which followed one of their cues in 
recall tended to be among those word 
associations which had association fre- 
quencies above the median association fre- 
quency of the association responses (list 
words only) to the cue word [p. 438]. 


One additional analysis, not reported by 
Rothkopf and Coke (1961b) was performed 
in the present study. A count was made of 
the number of cue linkages which occurred in 
the first and second halves of each S's recall 
list, excluding the initial word, since it could 
not manifest cue linkage in the absence of a 
cue. The hypothesis was that more cue link- 
ages would be found among the words written 
in the first than in the second half of the 
recall lists. This proved to be true for 19 of 
the 28 Ss. Of the 9 remaining Ss, 6 produced 
lists having an equal number of cue linkages 
in each half, and only 3 produced lists with 
more cue linkages in the second half. Thus, 
the hypothesis was confirmed by a one-tailed 
sign test which showed that the obtained 
difference in the predicted direction was 
significant (p < .001). 

The summarized conclusion of Rothkopf 
and Coke (1961b) was that ‘These results 
were consistent with a cuing interpretation 
of the frequency and sequence of recall phe- 
nomena which were the subject of this study 
[p. 438]."" Every aspect of the present 
replication supports their conclusion. 
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UNLEARNING AS A FUNCTION OF THE RELATIONSHIP 


BETWEEN SUCCESSIVE RESPONSE CLASSES! 


LEO POSTMAN, GEOFFREY KEPPEL, axp KAREN STARK 
University of California, Berkeley 


This study tests the assumption that in the A-B, A-C transfer paradigm 
the unreinforced evocation of covert or overt first-list responses during 
the acquisition of the second list results in the unlearning of the first-list 
associations. In an RI design using the A-B, A-C paradigm the work 
groups received interpolated trials on a second list in which the type of 
responses either remained the same as in List 1 or was different. Free 
recall of List 1 associations revealed greater RI and more interlist in 
trusions for the conditions using the same type of responses. It appears 
that the degree of unlearning depends upon the frequency with which 
first-list responses are elicited during second-list learning. These re- 
sults are consistent with the position that unlearning is a process sharing 
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some of the functional characteristics of experimental extinction. 


, Evidence for the unlearning of first- 
list associations (A-B) during the 
acquisition of a second list (A-C) has 
been presented by Barnes and Under- 
wood (1959). An unpaced test of 
recall in which Ss were required to 
report the responses from both lists 
showed that the number of first-list 
Tesponses recalled declined steadily 
as a function of the degree of second- 
list learning. Since the conditions of 
recall minimized the opportunities for 
response competition, the progressive 


lThis research was carried out under 
Ontract Nonr 222(90) between the Uni- 
Versity of California and the Office of Naval 
I esearch, This work was done at the 
Nstitute of Human Learning which is sup- 
Ported by a grant from the National Science 
oundation. 


increases in retroactive inhibition 
(RI) were attributed to the gradual 
weakening of the first-list associations 
during interpolated learning. In a 
later experiment, McGovern (1964) 
demonstrated that A-C learning not 
only weakens the A-B or forward asso- 
ciations but also reduces the availabil- 
ity of the first-list responses as such. 
Theoretical interpretations of these 
results have identified the unlearning 
of the verbal associations with the 
process of extinction in classical condi- 
tioning. Thus, it is assumed that 
covert or overt first-list responses are 
evoked during the acquisition of the 
second list and that as a consequence 
of their failure to receive reinforce- 
ment the first-list associations undergo 
extinction. 
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Although the concept of unlearning 
represents an important step in the 
analysis of the mechanisms of transfer 
in verbal learning, there have been 
few investigations of the boundary 
conditions within which this analysis 
applies. Thus, studies investigating 
unlearning in the A-B, A-C paradigm 
have typically used nonsense-syllable 
stimuli and adjective responses in both 
lists. In these studies, and in fact in 
all experiments on unlearning thus 
far, the B and C responses learned 
in the successive lists are from the 
same form class, e.g., adjectives. 
Within the framework of a theoretical 
analysis which identifies unlearning 
with extinction it becomes an im- 
portant question, however, whether 
the amount of unlearning varies with 
the similarity of the form class of the 
successive response terms. 

As stated earlier, first-list responses 
are assumed to undergo extinction as 
a result of their unreinforced evocation 
as implicit or explicit errors during 
second-list learning (cf. Melton, 1961; 
Melton & Irwin, 1940). It is possible, 
however, that unlearning is independ- 
ent of the evocation of the first-list 
response and reflects the displacement 
of an old response by a new one, 
These two interpretations lead to 
different predictions about the effects 
which manipulation of the form class 
of the responses should have on the 
amount of unlearning. It is reason- 
able to suppose that the probability 
of implicit and explicit interlist errors 
varies directly with the similarity of 
the form class of the successive re- 
sponse terms. If the degree of un- 

learning depends on the frequency 
with which first-list responses are 
elicited during the acquisition of the 
second list, the amount of RI should 
then be greater when the form class 
remains the same than when it 
changes. On the other hand, if un- 


learning depends merely on the ac- 
quisition of a new response to an old 
stimulus, the amount of RI should be 
independent of the similarity of form 
class. ` 

It is interesting to note that the 
prediction based on a theory of ex- 
tinction is exactly opposite to that 
implied by Osgood's (1949) analysis 
according to which RI increases as the 
similarity of the response terms de- 
creases. Since responses from the 
same form class are more similar than 
responses from different form classes, 
greater RI should be found under the 
latter condition. 

In the present experiment Ss 
learned two lists in which the stimuli 
(numbers) remained identical. The 
Tesponse terms in the two lists either 
came from the same class (adjective- 
adjective or letter-letter) or from 
different classes (adjective-letter or 
letter-adjective), This arrangement 
permitted a test of the experimental 
hypotheses stated above, and at the 
same time provided an opportunity to 
determine the generality of the un- 


learning phenomenon with new types 
of materials. 


METHOD 


Design.—The design comprised two RI 
work conditions which differed with regard 
to the similarity of the form class of the two 
sets of response terms, and a control condition. 
The work groups learned two lists of paired 
associates (A-B, A-C). One work condition 
(Cond. S) received the same type of response 
terms in the two paired-associate lists and the 
other condition (Cond. D) received different 
types of response terms in the two lists. The 
control groups learned a single list. A second 
variable, type of first-list pairs (number- 
adjective or number-letter), was combined 
factorially with the three conditions, Thus, 


the complete experimental design consisted 
of six different treatm 


f nent combinations, 
Malerials.—Each list consisted of eight 
paired associates in which the stimuli were 


the single-digit numbers 2-9 and the re- 
sponses were either adjectives or letters, The 
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two lists which were presented to the RI 
groups conformed to the A-B, A-C transfer 
paradigm. The 16 response words were two- 
syllable adjectives, chosen for minimal 
similarity from Haagen’s (1949) list of 
adjectives and divided into two sets of com- 
parable Thorndike-Lorge (1944) frequency. 
The 16 letters were selected by eliminating 
from the alphabet the three most frequent 
letters, all vowels, letters corresponding to 
Roman numerals, and letters for which con- 
fusion with other letters may occur. The 16 
letters were divided into two sets which are 
comparable in frequency of occurrence in 
printed text. The actual response units 
consisted of repeated letters, e.g., GG or ZZ. 
This was done to equate the time required 
for the pronunciation of the two-syllable 
adjectives and the letters. For both form 
classes, the two subsets of responses were 
used equally often as first-list responses. 

In Cond. S in which the two sets of response 
terms were from the same form class, the 
similarity between the two responses which 
would be associated with a common stimulus 
was minimized for both the adjectives and 
letters. For the adjectives the pairing was 
random with the exception that similar suf- 
fixes, similar meanings, and initial letters that 
are adjacent in the alphabet were not allowed. 
The pairing of the letters was such that the 
Paired letters did not rhyme, were not 
adjacent in the alphabet, and did not suggest 
abbreviations or words. For Cond. D two 
restrictions were placed upon the pairings of 
adjectives and letters: the letter could not 
Occur in the paired word and the letter could 
not be adjacent in the alphabet to the initial 
letter of the paired word. For word-word, 
letter-letter, and word-letter pairs, two such 
Pairings were employed. The particular 
Stimulus-response pairings were chosen to 
avoid similarities in sound and appearance. 
Finally, four different orderings of the pairs 
Were used to minimize serial learning of the 
responses and to provide four different 
Starting positions. 

For purposes of group designation two 
letters will describe the experimental treat- 
Ment. The first letter of the designation (A 
or L) will refer to the type of response term, 
adjective or letter, during first-list learning. 

he second letter will be assigned on the same 

asis to indicate the type of response term 
uring second-list learning, with the letter 
X (rest) used to designate the control condi- 
tion. Thus, Groups A-A and L-L comprise 
Cond. S, and Groups A-L and L-A comprise 
Cond. p. Groups A-R and L-R are the 
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control groups learning adjectives and letters, 
respectively. f 

Procedure.—Both lists were learned by the 
anticipation method at a 2:2-sec. rate; the 
intertrial interval was 4sec. The first list was 
presented to a criterion of one perfect recita- 
tion. Approximately 1 min. following the 
attainment of the criterion all Ss were given 
the instructions for the interpolated activity. 
For the RI groups this activity consisted of a 
second list of paired associates which was 
presented for 20 anticipation trials, while for 
the Control groups a series of arithmetic 
problems was administered. At the end of 
the interpolated activity (18 min. following 
the end of first-list learning) both groups 
were given a modifed free-recall test (MFR). 
The MFR test consisted of an unpaced 
presentation of each of the eight stimulus 
terms; to the right of each stimulus item was 
a dash indicating the missing first-list response 
and the appropriate second-list response. It 
was made clear to S that the second-list 
response would be provided in each case. The 
S was given unlimited time to give the correct 
first-list response for each stimulus, the drum 
being advanced whenever a response was 
given or whenever S indicated an inability to 
recall. The order of the stimulus terms on 
the MFR test corresponded to the order of the 
pairs on the trial following the first-list 
criterion trial. 

It should be noted that the present ex- 
periment represents a departure from the 
procedure employed by Barnes and Under- 
wood (1959) to study the unlearning of first- 
list associations. These investigators re- 
quired Ss to write down the response terms 
from both paired-associate lists. At the 
higher degrees of second-list learning there 
was a tendency for the Ss to give the second- 
list responses first. This order of recall has 
two possible consequences: it provides an 
additional unlearning trial and effectively 
lengthens the retention interval for the 
first-list responses. In the present experiment 
the effect of this "interpolated recall" was 
eliminated by limiting the test of retention to 
the first-list responses and by providing the 
correctly paired second-list responses at the 
time of recall. 

Subjects.—A total of 96 Ss, 16 in each of 
the six conditions, participated in the study. 
The Ss were undergraduate psychology 
students at the University of California. The 
experimental conditions were randomized into 
16 blocks of 6, each condition appearing once 
in each block. The particular list order, list 
pairing, and starting order were assigned 
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TABLE 1 
LEARNING AND RECALL MEASURES FOR THE VARIOUS GROUPS 


List 1 Form Class 


Letters Adjectives 
Group L-L | Group L-A | Group L-R | Group A-A | Group A-L | Group A-R 

i E jals te 
Bo cw an 13.8* 14.3 11.1 10.9 10.4 11.4 
List2: M numbers 

e CorrüctiPeSponSe 120.65 131.4 = 139.1 b 126.8 — 
MFR: Stringent 4.75¢ 7.06 7.69 4.75 6.38 7.56 
MFR: Lenient 5.192 7.44 7.81 4.81 6.44 7.56 


a MSy = 34.17. 


randomly to the 16 blocks with the restriction 
that these assignments be balanced over the 
96 Ss. The Ss were assigned to the ran- 
domized order of the conditions in order of 
their appearance in the laboratory. There 
were no restrictions placed on the selection 
of Ss, most of whom had participated in 
previous verbal-learning experiments. No S 
was eliminated for failure to reach the first- 
list criterion. 


RESULTS 


First-list learning —The mean num- 
bers of trials to a criterion of one 
perfect recitation on List 1 may be 
found in the first row of Table 1. 
In this table the six groups are 
represented by the column entries, 
the first three columns consisting of 
groups receiving letters as first-list 
response terms, the second three 
columns consisting of groups receiving 
adjectives as first-list response terms. 
Since the variances were homogeneous 
for all measures, the error mean square 
is given at the bottom of the table. 
The two work groups learning letter 
responses reached criterion somewhat 
more slowly than the remaining 
groups. Analysis of variance shows, 
however, that the variation in speed 
of List 1 learning is not significant 
(F = 1.22), nor is there a reliable 
overall difference between the two 


types of list, (1,90) = 3.17, p > .05. 
It should be noted that the average 
speed of learning is virtually identical 
for the two groups comprising Cond. S 
(A-A and L-L) and Cond. D (A-L and 
L-A), respectively. 

Second-list learning.—lhe mean 
numbers of correct responses (out of 
a possible 160) during the 20 anticipa- 
tion trials on List 2 are shown in the 
second row of Table 1. Adjective re- 
sponses again have an advantage over 
letter responses, and this difference 
reaches statistical significance in List 
2 learning, F (1, 60) = 5.31, p < .025. 
The degree of List 2 learning for 
Cond. S and D is almost exactly the 
same (F « 1). The interaction of 
type of list (A vs. L) with the condi- 
tion of interpolation (S vs. D) is not 
significant (F — 1.92). It is to be 
noted, however, that when the List 2 
responses are adjectives, Cond. S 
surpasses Cond. D (A-A > L-A) 
whereas the opposite is true when the 
List 2 responses are letters (A-L 
> L-L). As Fig. 1 shows, this inter- 
action is clearly present on the early 
trials of List 2 learning, and it is 
statistically significant when the first 
two interpolated trials are considered, 
F (1,60) = 13.00, p < .01. This find- 
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ing suggests that identity of response 
class may have two opposed effects on 
the speed of List 2 learning: (a) the 
set to give the appropriate class of 
responses facilitates performance; (b) 
failures to discriminate the list mem- 
bership of specific items within the 
common response class is conducive 
to interlist interference. The more 
limited the pool of items in the com- 
mon class the more likely it is that 
such interlist interferences will out- 
weigh the facilitative effects of re- 
sponse restriction. The total pool of 
letters is, of course, considerably more 
limited than that of adjectives. 
Hence the net difference between 
Cond. S and D is positive for adjec- 
tives and negative for letters, es- 
pecially on the early trials of List 2 
learning when response restriction and 
list differentiation should have their 
greatest effects. This interpretation 
is supported by an examination of the 
interlist intrusions in List 2 learning. 
Interlist intrusions during interpo- 
lated learning.—Neither of the groups 
in Cond. D gave any interlist intru- 
sions during List 2 learning. There 
were two intrusions contributed by 
two Ss in Group A-A and six intru- 
sions contributed by four Ss in Group 
L-L. Although the absolute fre- 
quencies of intrusions are as usual 
very small, the difference between the 
groups is in accord with the assump- 
tion that interlist differentiation in- 
Creases with the size of the response 
Pool. All but one of these intrusions 
Occurred during the first six trials of 
Interpolated learning. 
Recall.—The last two rows of Table 
1 show the mean numbers of correct 
List 1 responses on the MFR test. A 
stringent” score and a “lenient” 
Score are presented in each case. The 
stringent scores are based on responses 
given to the appropriate stimulus. 
he lenient scores are based on all 


Mean number correct 
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Fic. 1. Mean numbers of correct responses 
in interpolated learning. (There was little 
change in performance between Trials 11 
and 19.) 


List 1 responses recalled, regardless of 
whether or not they were given to the 
appropriate stimulus. 

The stringent scores will be consid- 
ered first. The two control groups, 
A-R and L-R, have very similar scores 
and show only a small amount of for- 
getting. Each of the four work 
groups recalls fewer List 1 responses 
than its corresponding control group. 
The overall difference between the 
control and work treatments is highly 
significant, F (1,90) = 42.37,p < 001. 
Thus, there is clear evidence for un- 
learning of first-list associations. The 
amount of unlearning, as measured by 
the control-work differences, does not 
vary reliably with the type of response 
term (F < 1). For both types of list 
the retention losses are, however, 
substantially greater in Cond. S (A-A 
and L-L) than in Cond. D (A-L and 
L-A), F (1, 90) for the difference be- 
tween the two conditions of interpola- 
tion is 34.45 (p « .001). Again this 
difference does not vary with type of 
list (F « 1). 

With one exception (A-R), the 
lenient scores of all groups are higher 
than the stringent scores. These in- 
creases reflect misplaced responses 
which were not also given to the 
appropriate stimulus terms. For each 
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TABLE 2 
INTRALIST ERRORS AND INTERLIST INTRUSIONS IN MFR 
List 1 Form Class 
Letters Adjectives 
Group L-L Group L-A Group L-R Group A-A Group A-L Group A-R 
f n f n f n f n f n n 
Intralist | 15 8 10 7 4 2 5 4 4 2 2 2 
Interlist | 13 8 0 0 — — 2 2 0 0 E mz 


Note.—f = total frequency of errors; n = number of Ss contributing errors, 


treatment, the group learning letter 
responses shows a greater gain than 
the group learning adjective responses. 
This differential gain results in a 
significant overall difference between 
the two types of lists in the amount 
recalled as measured by the lenient 
scores, F (1, 90) = 4.92, p < .05. In 
all other respects the pattern of 
differences, and notably the separa- 
tion between Cond. S and D, are the 
same for the lenient and the stringent 
scores, and are associated with the 
same levels of statistical significance. 

Errors in M FR.—Since the stimulus 
terms accompanied by the appropriate 
List 2 responses were exposed one by 
one in MFR, it was possible for S to 
make both intralist errors (misplaced 
responses from within List 1) and 
interlist errors (inappropriately paired 
List 2 responses). The total fre- 
quencies of such errors and the num- 
bers of Ss contributing each are shown 
in Table 2. It is clear that intralist 
errors occur more frequently in the 
recall of letters than of adjectives. 
This finding is brought out most 
clearly in the comparison between 
Groups A-A and L-L since the num- 
bers of correct responses and hence 
the opportunities for errors were 
identical in these two cases. The 
difference between Groups A-L and 
L-A is in the same direction and re- 


mains intact when opportunities for 
errors are taken into account. Inter- 
list errors at recall occur in Cond. S 
but not in Cond. D and again are 
substantially more numerous in the 
recall of letters than of adjectives. 


Discussion 


The results of the experiment show 
significantly greater unlearning of the 
first list in an A-B, A-C paradigm when 
the class of responses in the successive 
lists remains the same than when it 
changes. The difference between Cond. 
S and D does not interact with the type 
of list, i.e., it is present to a comparable 
degree in the recall of both letters and 
adjectives. These findings are in accord 
with the hypothesis that the amount of 
unlearning depends on the frequency 
with which List 1 responses are elicited 
as covert or overt errors during List 2 
learning. Thus, the interpretation of 
unlearning as a process sharing some of 
the functional characteristics of ex- 
tinction appears to receive additional 
support. 

The validity of this conclusion de- 
pends, of course, on the correctness of the 
assumption that the probability of un- 
reinforced elicitations of List 1 responses 
depends on the similarity of form class. 
There is substantial evidence for the 
operation of a selector mechanism in 
learning and recall (Underwood & Schulz, 
1960, pp. 143 ff.), i.e., for S's tendency 
to restrict his responses to the categories 
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of items represented in a list. It is 
reasonable to suppose, therefore, that 
during List 1 learning S's range of re- 
sponses becomes restricted to the class of 
items appropriate to that list. If the 
form class remains the same in List 2, S's 
responses will continue to be drawn 
from the same restricted range; to the 
extent that there are failures of differ- 
entiation among items within the appro- 
priate class, List 1 responses will occur 
as implicit or explicit errors. A change 
in form class leads to a new basis of 
response selection during acquisition of 
List 2, and hence to a reduction in the 
probability of interlist errors. The 
difference between Cond. S and D in the 
frequency of overt intrusions during 
List 2 learning conforms to these expecta- 
tions. The fact that there is some un- 
learning under Cond. D indicates, how- 
ever, that there are limitations on the 
effectiveness of the selector mechanism 
in protecting List 1 responses from being 
elicited as errors and undergoing ex- 
tinction. It is possible, of course, that 
extinction produced by unreinforced 
evocation of List 1 responses is not the 
only factor contributing to failures of 
List 1recall. For example, performance 
in MFR may not be completely free from 
the effects of response competition, i.e., 
dominant List 2 responses may some- 
times interfere with List 1 recall even 
Without prior extinction. 

Although the variations in the amount 
of unlearning are parallel for the two 
types of list, it is clear that interitem 
differentiation is more difficult for letters 
than for adjectives. Group L-L gave 
more interlist intrusions than Group A-A 
both during List 2 learning and in MFR. 
In addition, the numbers of misplaced 

Ist 1 responses were greater in the 
recall of letters than of adjectives. This 
difference has been attributed to the fact 
that the total pool of letters is consider- 
ably smaller than that of adjectives. 
ta experimental associations are likely 
© be strong among the members of a 
Strictly circumscribed class of responses. 

‘hus, letters are used continuously in 
ene combinations and there are 
rong sequential dependencies among 
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them. As a result, the members of this 
class will tend to elicit each other in 
learning and recall. There are no com- 
parable preexisting associations among 
randomly selected members of the broad 
class of adjectives. Moreover, the 
smaller the pool of items the more readily 
available all possible responses are to S. 
The tendency to guess may be expected 
to increase with the availability of the 
total range of responses. Guessing will, 
of course, increase the frequency of overt 
errors. 

There is no systematic relationship be- 
tween amount of unlearning and error 
rate in MFR. Although the numbers of 
misplaced responses and interlist errors 
are higher in the recall of letters than 
of adjectives, the stringent scores for 
Groups L-L and A-A are identical and 
Group L-A has an advantage over Group 
A-L. Itappears that overt errors occur 
when the correct responses are not avail- 
able, and that they do so at different 
rates depending on the class of responses. 
If this interpretation is correct, mis- 
placed responses and interlist intrusions 
in MFR are a consequence rather than 
a determinant of failures to recall the 
responses to specific stimuli (cf. Conrad, 
1960). 

A high rate of intralist errors in MFR 
will also serve to enhance the differences 
between the stringent and lenient scores. 
Thus, while the stringent scores for the 
two types of list do not differ signifi- 
cantly, the lenient scores are reliably 
higher in the recall of letters than of 
adjectives. For purposes of interpreting 
this pattern of differences it may be 
assumed that (a) the lenient scores 
measure the strength of the association 
of the responses to the environmental 
context, whereas (b) the stringent scores 
reflect in addition to association with 
context the strength of specific S-R con- 
nections (cf. McGovern, 1964; Under- 
wood, Keppel, & Schulz, 1962). It 
follows that the rate of unlearning of 
specific S-R associations is independent 
of the type of response whereas the 
association to contextual stimuli is more 
resistant to extinction for letters than 
for adjectives. This interpretation must 
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be viewed with caution, however, since 
the number of letter responses may well 
be inflated by guessing. In any event, 
the comparability of the stringent scores 
for the two classes of responses extends 
the generality of the unlearning phe- 
nomenon to include both letter and 
adjective responses to numbers. 

The results of the experiment have 
general implications for the role of 
response similarity in transfer and inter- 
ference. Reference has already been 
made to Osgood's (1949) analysis accord- 
ing to which negative transfer and RI are 
expected to increase as successive re- 
sponses to the same stimuli become more 
dissimilar. This relationship does, in- 
deed, hold true when degree of response 
similarity is varied within the same 
response class. Thus, with responses 
drawn from the same class, A-B, A-C 
(identical stimuli and unrelated re- 
Sponses) is a paradigm of negative 
transfer, and A-B, A-B' (identical stimuli 
and similar responses) is a paradigm of 
positive transfer. The present findings 
indicate that this relationship does not 
apply to the similarity of response 
classes and instead suggest a new prin- 
ciple: the more dissimilar the classes of 
responses attached to the same stimuli 
the more likely it becomes that the 
successive response systems will maintain 
their strengths independently of each 
other. 
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A paired-associate learning task involved functions which related 


numerical responses to numerical stimuli, 
The results showed that Ss used the 


each of 17 different functions. 


Some order characterized 


order incorporated in a function to reduce their error in responding to 


stimuli encountered for the first time. 


For all different functions, error 


in initial responses to new stimuli decreased as the number of experienced 


S-R pairings increased. 


In paired-associate tasks character- 
ized by orderly relations between the 
stimulus and response variables, it is 
possible that Ss can use the order in 
earlier experienced S-R pairings to aid 
them in responding to new stimuli. 
The purpose of the present experiment 
is to determine whether Ss use such 
order to decrease error in responding 
to a stimulus which has not yet been 
experienced. 

To illustrate, suppose that an S 
has experienced the following four 
S-R pairings: 1-11, 2-12, 3-13, and 
4-14. The S would very likely make 
the appropriate response, 15, to the 
Stimulus 5 even though he has never 

efore experienced it. Although the 
Order in this illustration is simple to 
the point of being trivial, the possi- 
bility exists that Ss can use more 
Subtle types of S-R order to reduce 
error in responding to new stimuli. 
The purpose of this experiment is to 
See if error in initial responses to new 
Stimuli decreases as the number of 
experienced S-R pairings increases, 


1 The research reported here was under- 
taken in the Behavior Research Laboratory, 
Nstitute of Behavioral Science, University of 
Olorado, and is Publication No. 40 of the 
Nstitute. This research was supported by a 
research grant (M-4977) from the National 
Institute of Mental Health. Grateful ac- 
*nowledgment is made by the authors to 

: R. Hammond and F. J. Todd for valuable 
Suggestions, 
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when responses are related to stimuli 
by some orderly function. Such im- 
provement would indicate a type of 
generalization or transfer of training 
which is mediated by abstracting the 
order in the function relating re- 
sponses to stimuli. 


METHOD 


Subjects:—One hundred thirty-five stu- 
dents from an introductory psychology course 
at the University of Colorado served as Ss. 

Stimuli, responses, and functions.—The 
ordered set of integers 11-36 served as the 
stimulus variable and the set of integers 
37-99 served as the response variable. 
Seventeen different functions paired a re- 
sponse with each of the stimuli. Nine of the 
functions were sampled from mathematics 
texts and included such functions as a straight 
line, a sine curve, an exponential curve, and 
a normal curve. Since only integers were 
used, discrete approximations to the theo- 
retically continuous functions were employed. 

The remaining functions were created to 
incorporate varying degrees of order. They 
are presented in Table 1. The left-hand 
column lists stimulus values; each of the other 
columns indicates the associated response 
value for one of the functions. All functions 
embody some order. The type of order be- 
comes intuitively apparent when they are 
graphed. 

Procedure.—Each S was assigned to a 
single function. The stimulus and response 
values were placed on a long strip of paper 
to be run through a typewriter. The S was 
presented with a stimulus number, typed his 
guess of the response number, and advanced 
the paper two spaces which allowed him to 
see the correct response number. Advancing 
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TABLE 1 


S-R VALUES FOR EACH FUNCTION 


Functions 


the paper one more space brought the next 
number into view. The typewriter was 
equipped with a shield which prevented S 
from seeing more than one previous trial. 

The stimulus sequence, identical for all 
functions and all Ss, was generated by a ran- 
dom process. The S was required to work 
until (a) he completed all of the available 
598 trials, or (b) the 2 hr. available for the 
experiment elapsed, or (c) he completed a 
series of 30 consecutive correct responses. 
Data analyses assume that those Ss who made 
30 consecutive correct responses would have 
responded correctly to all the remaining 
stimuli. 

Data analysis.—Six to eight Ss were em- 
ployed for each function. Seven Ss were 
discarded for failure to progress far enough 
through the trials to observe each of the 26 
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different S-R pairings at least once. 
left 128 Ss for the analysis. 

If Ss capitalize on S-R order, then error 
associated with responses to new stimuli will 
decrease as the number of experienced S-R 
Pairings increases. Thus, if Ss use order to 
extrapolate or generalize from experienced 
S-R pairings to novel pairings, the slope of the 
least-square straight line relating average 
error over Ss for the first occurrence of each 
stimulus to the ordinal position of the first 
occurrence will be negative. This slope was 
obtained for each of the 17 functions. 


This 


RESULTS AND CONCLUSION 


The mean slope of the least-square 
line relating average error for the first 
Occurrence of each stimulus to the 
ordinal position of the first occurrence 
was —.308; the slope for each of the 
17 functions was negative; and the 
95% confidence interval for the mean 
slope was bounded by —.206 and 
—.410. Thus average response error 
to new stimuli decreased about .3 unit 
for each additional S-R pairing ex- 
perienced. Despite their wide variety, 
some decrease characterized all func- 
tions. The conclusion is that Ss do 
use order embodied in S-R relations 
to aid them in responding to new 
stimuli. 

Additional analyses measured the 
relations between (a) degree of order 
characterizing the functions and (b) 
paired-associate performance on the 
functions. None of the measures of 
4 predicted b very well.? Yet, since 
Ss do use order in S-R relations, the 
quantification of such order remains 
an important, but unsolved, problem. 

? These analyses are contained in Behavior 


Research. Laboratory Report No. 39, Institute 
of Behavioral Science, University of Colorado. 
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RATE OF PRESENTATION IN SERIAL LEARNING 
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Serial-learning performance was studied under 2 rates of presentation, 


2 and 4 sec. 


Following the attainment of criterion (5/14 and 10/14) 


presentation rate was either switched to the other rate or not switched. 
A final condition consisted of an alternation of 2- and 4-sec. trials, Al- 
though criterion was reached more rapidly under the 4-sec. rate, the 


total learning time required did not 


form 
of presentation on the postcriterion 
having limited influence over the pos 


Bugelski (1962) varied the study 
interval in a paired-associate task 
Írom 2 to 15 sec. and found a direct 
relationship between learning rate 
and length of the study interval. 
However, total learning time was 
constant (Rate X Trials-to-Criterion) 
for the various groups. Although 
there have been no studies reported 
for serial learning where rate of 
presentation has been varied widely, 
there is some indication that a similar 
constancy holds over a restricted 
range of rates (e.g., Hovland, 1938; 
Melton & Stone, 1942). Moreover, 
there appears to be a limit to this con- 
stancy, Braun and Heymann (1958) 
reporting the invariance for high- 
Meaningful paralogs, but not for low 
Paralogs. 

The purpose of the present experi- 
Ment was to determine whether the 
attainment of a common criterion 
under different rates of serial presenta- 
tion reflects equal degrees of learning. 
In the traditional method of serial- 
anticipation learning, the study and 
testing intervals are unavoidably con- 
founded, variation in presentation 
rate resulting in the variation of both 
Of these intervals. Under these cir- 
cumstances equal performance may 
not reflect equal learning if the re- 
Sponses recalled represent differences 
in latency. That is, if some of the 
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differ between the 2 rates. Per- 


ance on the postcriterion trials was primarily a function of the rate 


trials with the precriterion rates 
tcriterion trials. 


items which were correct at the slower 
rate represent latencies which are 
longer than the length of the interval 
for the faster rate, these items may be 
said to reflect less strength than an 
item which is anticipated within the 
shorter interval. To the extent that 
performance at a slower rate consists 
of longer latency responses, attain- 
ment of the same criterion does not 
insure equal degrees of learning. If 
this is the case, conclusions concerning 
the constancy of total learning time 
under different rates must be rejected 
for serial learning. 

In this study Ss learning under two 
rates of presentation, 2 and 4 sec., 
were taken to the same criterion level. 
Following the attainment of criterion, 
presentation rate was either switched 
to the other rate or not switched. If 
performance on the criterion trial 
reflects equal degrees of learning for 
the two rates of presentation, the two 
switched groups should adjust im- 
mediately to the postcriterion rate. 
On the other hand, if performance on 
the criterion trial reflects differences 
in responses latencies for the two 
rates of presentation, the slow-fast 
group should be inferior to the fast 
nonswitched group and the fast-slow 
group should be superior to the slow 
nonswitched group on the postcri- 
terion trials. 
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METHOD 


Design—Two variables, presentation rate 
(2 and 4 sec.) and postcriterion condition 
(switched and nonswitched), were combined 
factorially at two different preswitch criteria 
(5/14 and 10/14). Independent groups of Ss 
were employed for all switched conditions, 
while the four nonswitched conditions were 
represented by two groups both taken to the 
higher criterion, one for each of the two rates. 
A final condition consisted of a group receiving 
alternated fast-slow trials throughout the 
course of learning. 

Procedure.—Precriterion training was given 
under either a 2- or 4-sec. rate of presenta- 
tion. In the designation of conditions, the 
letters F and S will refer to the 2- and 4-sec. 
rates, the first and second letters to the pre- 
and postcriterion rates, and the subscripts 
L and H to the low and high criteria, re- 
spectively. Following the attainment of 
the criterion, the two nonswitched groups 

(Groups F-F and S-S) continued training at 
the precriterion rates and the two switched 
groups (Groups F-S and S-F) continued 
training at the reversed, postcriterion rates. 
The alternation group (Group Alt.) received 
the study trial at the 2-sec. rate, followed by 
reversal in rate (4, 2, 4, etc.) at the end of each 
anticipation trial. Postcriterion training 
consisted of an additional 10 trials at the 
postcriterion rate. For the nonswitched 
groups and Group Alt., the 10 transfer trials 
were administered following the attainment 
of the higher criterion. Thus, it was possible 
to employ these as comparison groups for the 
switched conditions at both criteria. 

Standard serial anticipation learning was 
employed. The Ss were not notified of the 
rate change, but were told to expect either a 
slowing down or a speeding up of the turning 
drum and that the rate change would be made 
during the intertrial interval. In addition, 
Ss were asked to give their responses as 
quickly as possible at all times since they 
would receive no warning of the change in the 
rate of presentation. 

Materials—A single 14-adjective serial 
list, having high intralist similarity, was 
selected from the lists reported by Under- 
wood and Goad (1951). An asterisk served 
as the anticipatory cue and the intertrial 
interval was either 2 or 4 sec. Since interest 
was not centered upon a precise determination 
of the serial-position curve, a single ordering 
of the adjectives was utilized. 

Subjects Students enrolled in the intro- 
ductory psychology classes at Northwestern 
University, for which service in experiments 
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is a course requirement, served as Ss in the 
experiment. Most of the Ss had participated 
in previous paired-associate and serial experi- 
ments. The experimental conditions were 
randomized in blocks of seven and Ss were 
assigned these conditions in the order of their 
appearance in the laboratory. Each group 
contained 15 Ss. No Ss were eliminated for 
failure to reach criterion. 


RESULTS 
Precriterion Performance 


Overall comparisons.—Except for 
different presentation rates all groups 
were treated alike on the precriterion 
trials. An analysis of the Rate 
X Switching factorial at each criterion 
provides a test of the rate variable and 
comparability of groups. At the 
lower criterion the only significant 
source of variability was presentation 
rate, F (1, 56) = 10.41, p < .01, the 
slower rate resulting in faster learning 
(X — 3.17) than the faster rate 
(X — 5.63). The same results were 
found at the higher criterion, F (1, 56) 
= 36.02, p < .01, the mean trials to 
criterion for the F and S groups being 
15.17 and 7.87, respectively. How- 
ever, this latter analysis also revealed 
a significant interaction between the 
two variables, F (1,56) = 4.69, p <.05, 
in which Group F-F reached criterion 
significantly faster than Group F-S, 
t (56) = 2.37, p< .05. Since the 
conditions were randomly assigned to 
Ss, the difference between the two 
groups must be attributed to sampling 
error. Comparison of the F groups 
at the lower criterion indicates that 
Group F-F is deviant, requiring 4.7 
trials to reach criterion while Groups 
F-Sz and F-Sy required 6.6 and 7.2, 
respectively. The influence of Group 
F-F will be minimized in the compari- 
sons which follow by combining the 
various F groups in precriterion com- 
parisons. Moreover, since none of the 
correlations between the criterion 
measure and postcriterion measures 
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are significant, the precriterion differ- 
ence appears to be relatively unim- 
portant in interpreting postcriterion 
data. 

Learning-time consta ncy.—Although 
performance under the two rates of 
presentation required different num- 
bers of trials to reach criterion, an 
inspection of means reveals a con- 
stancy in total learning time. Since 
in terms of total presentation time 
two 2-sec. trials are equivalent to one 
4-sec. trial, the scores for the fast rate 
were divided by two and compared 
with those for the slow rate. At both 
criteria the corrected criterion scores 
for the combined groups did not differ 
between the two rates (F = .23 and 
.06). 


In these comparisons and the ones 


reported by others (e.g. Braun & 
Heymann, 1958; Bugelski, 1962), 
learning-time constancy has been 


demonstrated only with the trials-to- 
criterion measure. At the start of 
Trial 1, Ss in the S groups had studied 
each item for 4 sec. on the preceding 
study trial, while Ss in the F groups 
had studied each item for 2sec. How- 
ever, total study time is equivalent at 
the start of Trial 2 since the F groups 
received an additional 2-sec. study on 
Trial 1. Thus, performance on Trial 
1 for the S groups and on Trial 2 for 
the F groups allows a comparison of 
the two groups with total presentation 
time held constant. Other points of 
comparison are possible between con- 
secutive S trials and even-numbered 
F trials. For the present experiment 
performance was compared over the 
first four comparison trials. The total 
number of correct responses was 
slightly higher for the S groups 
(X = 18.31 and 17.97), but this differ- 
ence was not significant 
No consistent differences between the 
two rates over the four trials were 
evident (F = .15). In short, when 
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Fic. 1. The mean number of correct re- 
sponses for the 10 transfer trials for the 
various groups following the attainment of 
the low and high criteria. (The group 
designations indicate the pre- and post- 
criterion presentation rates. Group A was 
administered the S rate on the even-numbered 
trials and the F rate on the odd-numbered 


trials.) 


comparisons of performance under 
different rates of presentation are 
arranged so that total study time is 
equivalent, no differences are observed 
in the number of responses correctly 
anticipated on these trials as a func- 
tion of presentation rate. 


Postcriterion Performance 


Postcriterion trials Following the 
attainment of either criterion, Ss were 
given an additional 10 transfer trials 
at either the same or different pres- 
entation rates. The mean numbers of 
correct responses on the 10 transfer 
trials for the various groups are 
presented in Fig. 1. The triangles, 
unconnected for visual purposes, are 
for Group Alt. and will not enter into 
the main comparisons to be reported. 
Looking first at performance following 
the lower criterion, it appears that all 
groups except Group S-F are initially 
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equal in performance and that with 
additional trials Group F-F separates 
from the two S groups to merge with 
Group S-F. An analysis of the four 
groups in terms of the total number of 
correct responses summed across the 
10 trials showed only the postcriterion 
rate to be significant, F (1,56) = 30.76, 
p «.01. An analysis over trials re- 
vealed a significant interaction with 
postcriterion rate, F (9, 504) = 4.95, 
p < .01, indicating a reliable differ- 
ence in learning curves for the post- 
criterion rates. Other than Trials, 
none of the remaining comparisons 
was significant. Even though none of 
the comparisons including the pre- 
criterion rate was significant, Groups 
F-F and S-F were compared to deter- 
mine the reliability of the differences 
reflected in Fig. 1. Neither a summa- 
tion over trials (X = 77.60 and 63.00) 
nor the interaction of groups and 
trials was significant, F (1, 28) = .53 
and F (9,252) — 1.36, respectively, 
p.05. 

The performance following the 
higher criterion is similar to that ob- 
tained following the lower criterion, 
with Group S-F again falling consis- 
tently below Group F-F and the S 
groups presenting indistinguishable 
learning curves. An analysis of the 
total number of correct responses re- 
vealed only the postcriterion rate to 
be significant, F (1, 56) = 34.59, 
p < .01; none of the interactions with 
trials was significant. A separate 
analysis of Groups F-F and S-F pro- 
duced no significant differences for the 
comparisons of interest. In short, the 
analyses of the transfer trials after 
either criterion indicate that the only 
significant source of variability on the 
postcriterion trials is the rate which is 
being presented at that time. 

Comparisons with Group Alt.—The 

performance of Group Alt. on the 
trials following the attainment of the 


GEOFFREY KEPPEL AND ROBERT J. REHULA 


two criteria is indicated in Fig. 1 by 
means of the unconnected triangles. 
Since in this condition presentation 
rate was shifted on each trial, some of 
the Ss reached the criteria on the slow 
trial and some on the fast trial. Of 
the 15 Ss in this group, 13 reached the 
criterion of 5/14 and 12 reached the 
criterion of 10/14 on the S trial. In 
order to give a clear picture of the 
performance of Group Alt. on the 
postcriterion trials, the data of the Ss 
reaching criterion on the slow trial 
have been grouped and plotted in 
Fig. 1. Thus, on the odd-numbered 
transfer trials the list was presented 
at the F rate and on the even-num- 
bered trials the drum turned at the S 
rate. The most salient aspect of the 
performance of Group Alt. is the fact 
that performance appears to be en- 
tirely determined by the rate of pres- 
entation on any given trial. To test 
this observation a comparison was 
made between Group Alt. and the 
combined switched and nonswitched 
groups on the trials for which the rate 
of presentation was the same. The 
results of these analyses were uni- 
formly negative, the largest F (1, 40) 
for the total number of correct re- 
sponses summed over the five com- 
parison trials being 1.13, p > .05, and 
the largest F (4, 156) for the compari- 
son of the interaction of the combined 
groups with Group Alt. over the five 
trials being 2.15, > .05. It appears, 
then, that the performance of Group 
Alt. is entirely determined by the 
presentation rate given on any trial. 


Discussion 


The results of the present experiment 
corroborate the findings of others that 
rate of presentation in serial learning is 
an effective variable when measured in 
terms of trials to reach criterion, but 
does not produce differences in learning 
when measured in terms of total learning 
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time. Wider variations in rate and in the 
sampling of learning materials, however, 
will be necessary to determine the limits 
of this learning-time constancy for serial 
learning. In this regard, it is interesting 
to note that although presentation rate 
was not varied, a similar constancy was 
obtained by Waugh (1962) for different 
lengths of serial lists. 

It was argued earlier that if the attain- 
ment of a common criterion implies equal 
degrees of learning under the two rates, 
a relatively fast and complete switchover 
to the postcriterion rate should occur. 
This is what appears to have happened: 
postcriterion performance was a function 
of the rate which was operating at that 
time. This conclusion obtains support 
from the postcriterion performance of 
Group Alt. where, after both criteria 
and for both rates of presentation, 
performance was perfectly appropriate 
to the current presentation rate. 


on 
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The present research falls in the 
tradition set by Atkinson and Suppes 
(1958), Suppes and Atkinson (1960), 
Burke (1959, 1960), Estes (1962), and 
Hall (1962) in that it involves the 
analysis of small group behavior in 
terms of extensions of concepts and 
procedures from statistical learning 
theory. 

Each group in this experiment con- 
tained three Ss, and, on each of a 
series of trials, the group made a 
decision by the scheme which Smoke 
and Zajonc (1962) refer to as "dicta- 
torship.” But the individual who 
determined the response for the group 
on a given trial ("dictator") was 
chosen for that role by a majority vote 
of all members of the group. We shall 
refer to the individual who makes the 
decision on a given tria] as the 
"leader," although he can exercise 
only a few of the attributes commonly 
associated with the leadership role. 
'The broad purpose of the research is 


!'This research was supported by the 
System Development Corporation where the 
senior author is a consultant to the Decision 
Processes Staff of the Research Directorate, 


g success and the likelihood of being 
aker) of a 3-man group. 
based on extensions of concepts used in mathematical 


BINDER 
University 
STANLEY J. TEREBINSKI 


ment Corporation 


was the investigation of the relation- 


voted 
Markov models, 
learning theory, 
Each trial of the experiment be- 
group vote, and ended after the 
r the group. Obtained and pre- 
voting shifts, asymptotic leadership 
g trends. 5 different reinforcement 
these groups were there major dis- 
ected results, 


k. 
by 


the investigation of the relationship 
between decision-making success and 
the likelihood of leadership selection 
in subsequent trials, using a mathe- 
matical model as the basis for analysis, 

The model under consideration is 
Markovian. Let X ; Y, Z designate 
each of the three group members, let 
7i,» be the probability that the deci- 
sion of player i will be reinforced on 
Trial x if he is leader, and let X; YjiZ, 
represent the states of the Markov 
process, where i = y, z; j =x, 2; 
k=x,y. For example, the state 
X,Y.Z, represents the condition where 
X designated Y as his leadership 
choice, Y designated Z, and Z des- 
ignated Y. In this particular case, Y 
would become the elected leader. A 
random selection of leader was neces- 
Sary in such states as Xy,Y.Z, (which 
would occur by chance in i of the 
trials). 

We assume that the process of 
changes over trials in leadership vot- 
ing involves probabilistic learning 
with reinforcement depending upon 
the correspondence between the lead- 
er’s decision and that designated as 
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correct by the E. For this learning 
we assume a one-element pattern 
model with an axiom structure as 
summarized in Table 1 under the 
column ‘‘8-State.”’ 

Letting m:n — 7: for all z, the 
transition matrix for this eight-state 
Markov process is readily obtainable: 
For example, the probability of the 
transition from X, Y.Z. to X, Y.Z. is 
mz + (1 — e)?(1 — s) while the prob- 
ability of the transition from X.Y.Z, 
to X, Y;Z, is beL JT 7:)]. 

Since an eight-state Markov chain 
is cumbersome, it is convenient to 
introduce certain simplifying assump- 
tions in order to reduce the number of 
States. The alternate axioms are 
summarized in the column ‘‘3-State” 
of Table 1. The transition matrix 
for the three-state process is shown in 
Table 2. 


METHOD 


Subjects.—A total of 102 male and female 
college undergraduates was used. These were 
recruited from various schools in the Los 
Angeles area and paid at an hourly rate. 

A pparalus.—Each S sat at a table enclosed 
On three sides by heavy curtains which served 
to prevent Ss from seeing the stimuli and 
responses of others. On the table before each 
Swas an 83 X 11 in. cathode-ray tube display 
and a console with response buttons. Char- 
acters on the display tube were generated from 
a dot matrix. There were 24 such units in the 
experimental room with the necessary auxi- 
làry equipment that made it possible to run 
eight groups (three Ss to a group) simultane- 


TABLE 1 


ALTERNATIVE AXIOM SYSTEMS 


Probability of 
Shift of Member's 
Contingency Pattern 


8-State | 3-State 


I The man designated by 
a particular member be- 
comes leader. His de- 
cision is 

A Reinforced 0 0 
B Not reinforced c c 


II The man designated by 
a particular member 
does not become leader 
but the member himself 
becomes leader. The 
leader's decision is 

A Reinforced 0 iS 
B Not reinforced 0 E 


The man designated by 
a particular member 
does not become leader 
but the other man he 
could have designated 
does. The decision is 

A Reinforced 

B Not reinforced 


II 


1 
(1—e) 


on 


ously. Separate sets of response buttons were 
used for the leadership vote and for the 
leader’s decision. Since Ss were not permitted 
to vote for themselves, the button that would 
indicate a self-vote was covered. Indicator 
lights on the consoles were used to show 
voting and decision phases. 

A Philco S-2000 computer, located in 
another room, generated the displays, timed 
the trials, recorded responses, and produced 
data print-outs following the experiment. 


TABLE 2 
TuREE-SrATE TRANSITION MATRIX 
x Y A 
2c Sg We 5c gg 
x net in (1- E43) a-(&-5) a-x( 1-5 


r pp-a) fee $e) 4) 
MEE ER a-za( ¥-§) s a-e)(1- © + 22) 
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(b) Lar. decision 
phase (Non-leader's 
display) 


(a) Initial voting 
phase 


CHOOSE 
LEADER 


MEMBER 2 


MEMBER 2 
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(c) Ldr. decision 
phase (Leader's 


(d) Later voting 
display) 


phases 
LOR WRONG 


LDR 3 
PREDICT A OR B 


GROUP g 
RIGHT 
000 


MEMBER 2 


MEMBER 3 


Fic. 1. Appearance of displays. 


Three sets of punched cards, which were fed 
into the computer, controlled the reinforce- 
ment proportions of each group member when 
he acted as leader, Each set of punched cards 
consisted of a random sequence of reinforce- 
ments and nonreinforcements. 

Procedure.—At the start of the experiment 
the information contained in Fig. 1a was on 
the scope for Ss (note that the illustration is 
for the member whose number was 2; the 
member’s number always appeared at the 
bottom of the display). This introduced the 
first or voting phase of Trial 1 which was 
defined as the period during which Ss could 
vote for a leader. The Ss were instructed to 
vote for one of the other two members of their 
group for leader who would in turn make a 
decision for the group as a whole. They were 
motivated by statements to the effect that the 
goal of each group was to make a high score 
(percent “Right” decisions) and that only the 
leader, as the group representative on any 
trial, had the Opportunity to improve the 
score. A failure to vote on a given trial 
made the member ineligible to be selected as 
the leader on that trial, 

The Ss were assigned to the booths ran- 
domly and care was taken to prevent them 
from determining the other members of their 
groups during the experiment. 

During the first 25 trials Ss were given 6 
sec. to vote for a leader, Beginning with the 
twenty-sixth trial the entire time cycle was 
speeded up and the time allowed for voting 
was reduced to 43 sec. 

Following this voting phase came the 
decision phase of the trial, during which only 
the designated leader responded. Two 
responses were possible for the leader during 
this phase, predicting that either an A or aB 
was the correct decision by pushing the 
appropriate decision button on the console. 
The display which appeared during this phase 
on the scope of the two members who Were not 
leader on the trial may be seen in Fig. ib. 


The display of the leader for this phase 
differed only in that it contained instructions 
to predict A or B, in addition to the other 
information (see Fig. 1c). The leader was 
allowed 4 sec, to Push the A or B button 
during the first 25 trials and 3 sec. thereafter. 

The trial ended When the displays of the 
decision phase were removed from the scopes. 
the next trial was 
signaled by the appearance on all scopes of the 
i Fig. 1d. 


There was a delay of about 21 sec. between 
the end of a given phase and the disappear- 
ance of the scope information appropriate for 
that phase. This delay was necessitated by 
computer and Programing characteristics, 

The Ss were run for 300 trials, which took 
about an hour, and then given a 10-min. rest. 


During the rest period Ss were cautioned not 
to discuss the 


composition by group 
Performance scores, Another 300 trials 


Random selection of a leader was made 
by the computer using a random number 
generator whenever: (a) all members voted 
and the result was a tie, (b) one member did 
not vote and the other two voted for each 
other or both voted for the ineligible member, 
or (c) none of the members voted, 


_The groups were run under five different 
reinforcement conditions, where a condition 
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TABLE 3 
EXPERIMENTAL CONDITIONS 


Reinforcement Cond. Number of Groups 


975 8 
951 5 
753 10 
777 ó 
333 5 


is defined by the set of z;'s for the given group. 
Table 3 shows the sets of reinforcement 
Probabilities (conditions) used and the num- 
ber of groups run under each, Reinforcement 
Cond. 975, for example, means that one 
member of the relevant group was reinforced 
(display indicating "Ldr Right") with a prob- 
ability (p) of .9 for his decisions when leader, 
another member was reinforced with $ = .7, 
and the third member with p —.5. For 
Reinforcement Cond. 777 and 333 the predic- 
tions of all three members were reinforced 
With the same probability, .7 or .3. 
Reinforcement schedules were randomly 
determined in advance under the constraint 
that the given proportion hold exactly in 
Sroups of 20 trials. When a leader was 
Scheduled for reinforcement (or nonreinforce- 
Ment) it made no difference whether his 
response was actually A or B. A difficulty 
only arose when a leader failed to decide on a 
d trial, so that he was considered "Wrong" 
Ca er what his reinforcement schedule 
Es ed for. However, compensation was ac- 
mplished by designating as "Right" the 
very next decision he made as leader which 
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was scheduled to be designated as “Wrong.” 
Thus, it was possible to maintain a constant 
reinforcement proportion except toward the 
end of the experiment when an automatic 
"Wrong" might not be followed by a sched- 
uled “Wrong.” 

Within all reinforcement conditions there 
was a counterbalancing of member’s number 
(1, 2, or 3) with reinforcement probability to 
compensate for any response bias. The un- 
equal numbers over the various conditions 
stemmed from equipment breakdowns as well 
as problems of S availability. 


RESULTS 


The relative frequencies of shifts in 
pattern conditions on given trials as 
functions of the status of the leader 
and the correctness of his decision 
on immediately preceding trials are 
shown in Table 4. The frequencies 
were computed over all Ss in each of 
the reinforcement groups. The data 
for cases in which S failed to vote or 
voted for himself on one of the 
relevant trials were ignored in the 
analysis for that particular trial. 
Erroneous voting occurred about 5% 
of the time. 

The parameter can be estimated 
from I B or III A (Tables 1 and 4) for 
the eight-state process and from IB 
or III B for the three-state process. 


TABLE 4 
CONTINGENT VOTING BEHAVIOR OVER SS WITHIN REINFORCEMENT GROUPS 


Proportion Shifting Vote from Trial n to Trial (1 + 1) 


Contingency 
777 Groups | 333 Groups | 951 Groups | 753 Groups | 975 Groups Aree k 
TELA 
A .232 .083 .097 A13 62 
B 5 1390 1365 1325 “406 “405 
ut 113 184 119 192 
A .256 j i : t 
B F^ .271 .246 .234 .192 :249 
7 500 499 319 439 
A 431 F P : F 
B “308 1254 171 1197 "172 “220 
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Because of the desirability of using the 
same value in both matrices, I B was 
chosen for this purpose. 
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The formula for the stationary 
absolute probabilities of the three- 
state matrix is as follows: 


(1 — r) (1 — r) 


uy 


where 7, j, k = X, Y, Z but iz j 
7k # i. However, it was not pos- 
sible to obtain a general formula for 
the stationary probabilities of the 
eight-state matrix. Consequently, 
these probabilities were obtained by 
the process of raising the transition 
matrix to a high enough power for the 
necessary stability. The asymptotic 
proportions of leadership designation 
as a function of the probability of 
reinforcement for each member were 
derived from the last 200 trials of the 
experiment. These obtained values, 
together with the values expected on 
the basis of the stationary absolute 
probabilities of the three- and eight- 
state processes, may be seen in Table 
5. The obtained values were calcu- 


(l—m)—-)+A—-—al—7)+0—-mda— Tk) 


lated on two bases: first, using all 
trials no matter how the leader was 
selected, and second, excluding trials 
on which one or more members either 
did not vote or voted for themselves. 
For comparison purposes, the propor- 
tions expected on the basis of match- 
ing are also shown in Table 5; match- 
ing implies leadership selection at a 
relative frequency proportional to the 
probability of reinforcement, 

The obtained asymptotic propor- 
tions (last 200 trials) for the states of 
the eight-state process, together with 
the predicted values, are shown in 
Table A.? 


? Tables A and B and Figures A, B, and C 
have been deposited with the American 
Documentation Institute. Order Document 


TABLE 5 
ASYMPTOTIC PROPORTIONS Or LEADERSHIP Status 
Obtained Predicted 
Groups Reinforcement 
All Trials [With Exclusion 3-State 8-State Matching 
975 9 .65 .67 .65 .65 A3 
A 23 .22 E .22 .33 
5 419 ll 13 A3 .24 
951 2 43 «15 15 76 .60 
5 El 19 45 E .33 
A .06 .06 10 .09 .07 
753 E .66 rol 49 .49 AT 
5 .26 22 -30 .30 33 
3 .08 .07 21 321 .20 
7 4 .33 34 .33 .33 , 

17 7 32 31 "33 33 3 
E .35 E .33 .33 .33 
3 .28 .28 33 339. .33 
333 3 736 .36 .33 33 .33 
3 .36 .36 3 33 .33 
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Trends of leadership selection 
in 975 groups. 
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"Trends of leadership selection 
in 951 groups. 
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FiG.4. Trends of leadership selection 
in 753 groups. 


..Now weturn to the learning process. 
igures 2 to 6 show the trends of 
leadership selection in blocks of 40 
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Fic. 6. Trends of leadership selection 
in 333 groups. 


taken over all groups given a specified 
set of z's. These curves are based on 
the proportions each member was 
leader over the trials. The theoretical 
learning curves, based on the eight- 
state matrix, are also present in the 


figures. 
DISCUSSION 


While there would seem to be no rea- 
sonable basis for choosing between the 
eight-state and three-state matrices on 
the basis of asymptotic relative fre- 
quencies (Table 5), Table 4 does present 
relevant distinguishing data. Thus, 
using Table 1, in addition to Table 4, 
we note the following differences be- 
tween the processes: the eight-state 
assumes the equality of the probabilities 
of shifts of patterns under Contingencies 
IB and IIIA, while the three-state 
assumes equality for the probability of 
shift under I B and the probability of 
remaining conditioned under III B; the 
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eight-state assumes no shift under Con- 
tingencies II A and II B, while the three- 
state assumes a probability of .5 for 
shift under both these contingencies; the 
probability of shift under Contingency 
III A is c for the eight-state and 1 for the 
three-state; and finally, the probability 
of shift under Contingency III B is 0 
for the eight-state and (1—c) for the 
three-state. In each of these compari- 
sons the axioms of the eight-state process 
are more closely in accord with the data 
(using the averages in Table 4) than 
those of the three-state process, and in 
most cases considerably so. The relative 
frequencies of shift under Contingencies 
IIA and IIIB (.439 and .220, re- 
spectively) are particularly telling. Ac- 
cording to the eight-state process, the 
former should equal c (= .405 for I B) 
and the latter 0, while for the three-state, 
the corresponding values are 1 and (1—c) 
which is equal to .595. 

It may understandably be argued that 
even the eight-state model does not 
provide a very impressive fit in those 
cases where the theoretical probability 
is zero. One can only reply that the 
balancing of positive and negative dis- 
crepancies from the predicted value is 
not possible when the theoretical proba- 
bility is zero and that nothing more than 
a first (and reasonable) approximation 
is claimed for the model. Moreover, the 
magnitude of the discrepancies from the 
p = 0 values is within the range found 
by other investigators in similar research 
areas; see for example Suppes and 
Atkinson (1960). 

While the fit between obtained and 
predicted proportions of leadership des- 
ignation (Table 5) as well as the overall 
theoretical and actual learning curves 
(Fig. 2-6) are generally close, a con- 
spicuous set of failures occurs with 
Groups 753 in both asymptotic results 
and learning trends. A few other 
characteristics of Table 5 should be 
mentioned before these discrepancies are 
pursued further: First, the two processes 
predict almost identical asymptotic pro- 

portions, and second, the use of the data 
from all trials provides an excellent 
approximation to the more accurate (in 
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terms of model axioms) analysis ex- 
clusively in terms of trials on which all 
Ss voted. 

The discrepancies for Groups 753 in 
Table 5 (as well as in Table A) clearly 
involve the choice of the m = .7 member 
as leader (from States 755737 and 735737) 
at the expense of the other states. 
Table B (see Footnote 2) contains an 
analysis of leadership choice among the 
753 groups separately in an attempt to 
determine the generality of the dis- 
turbance. Seven of the 10 groups show 
marked overshooting in the choice of the 
-7 member as leader, and only two groups 
show proportions within reasonable dis- 
tances of the prediction (.49). And in 
the two cases of adequacy for m = .7, the 
predictions for m = .5 and v = .3 are off 
by considerable margins. The data 
surely indicate that the miss between 
obtained and predicted (from the model) 
results in the 753 case is not an idio- 
syncrasy of one or a few groups which, 
if disregarded, would lead to adequate 
fit for the averages of the remaining 
groups. 

In the comparison of predicted and 
obtained asymptotic proportions for the 
states of the eight-state process (Table 
A), it is interesting that the major source 
of discrepancy for the two other sets of 
groups where members did not have 
equal reinforcement probabilities (that 
is, Groups 975 and 951) is the tendency 
for the member getting reinforced at 
the highest rate (in these cases 7 = .9) 
to vote more frequently than expected for 
that member of the two remaining who 
has the higher reinforcement rate: that 
15, in the case of Groups 975, 9 votes too 
frequently for 7 rather than for 5; and 
in the case of Groups 951, 9 votes for 5 
rather than for 1 more than expected. 
The high preference of 9 for choosing the 
individual most likely to get reinforced 
may stem from an unusually strong 
motivation for group success. It should 
be noted that this preference does not 
affect the relative frequencies of the 
alternate conditions of leadership, so 
that we find the close fit between pre- 
dicted and obtained proportions (Table 
5) for these groups. 


MATHEMATICS OF SMALL-GROUP LEADERSHIP 


Another way of evaluating the data is 
to consider that each S was faced with a 
discrimination problem which was more 
or less difficult depending upon the 
reinforcement ratios of his group as a 
whole and upon his specific reinforcement 
ratio within the group. The differential 
reinforcement any member received for 
voting for one member as against the 
other, and whether his own decision 
reinforcement was above or below the 
chance level of .5 might well have in- 
fluenced voting behavior. 

Members of the 975 and 753 groups 
had a more difficult discrimination to 
make than members of the 951 groups. 
Within those three groups, the members 
with the medium reinforcement ratio 
should have less difficult discriminations 
than the members receiving the high or 
low reinforcements. However, when the 
differences in reinforcement ratios are 
relatively large, as in the 951 groups, all 
members have a relatively easy dis- 
crimination task. 

By recasting the data in Table A to 
show differential voting behavior of 
members of the various groups rather 
than occurrence of states it is possible to 
examine the effects of these factors. This 
has been done in Table 6. 

This table was constructed by sepa- 
rately summing up the proportions for 
the four states by reinforcement member 
where he voted for the more successful 
member and for the four states where he 
voted for the less successful. Assuming 
that the member who is reinforced most 
frequently for his decisions has the 
greatest motivation for group success, we 
Would expect overshooting toward higher 
reinforcement members to show up most 
clearly in the case of Groups 951. This 
is confirmed in Table 6 by the large 
Overweighting of the preference of the 
high-reinforcement member in favor of 
the medium- (rather than the low-) 
reinforcement member for 951 as com- 
pared with 975 and 753. That a similar 
tendency may not be present for medium- 
and low-reinforcement members 1s 1n- 
dicated by the relatively close fit between 
their obtained and predicted proportions 
in Groups 975 and 951. 
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TABLE 6 
ASYMPTOTIC VOTING PROPORTIONS 
Groups | Memb Other i 
p er | Members | Obtained | Predicted 

9 zi 446 | .587 

5 .254 443 

975 7 9 185 .769 
Ps) 215 .231 

5 9 775 .720 

ef .225 .280 

9 0 .909 .565 

aA 091 435 

951 E .9 .853 .851 
x 147 149 

E .9 -191 .809 

5 209 191 

4 5 .587 554 

3 A3 4446 

753 5 i .865 .654 
3 435 .346 

3 d. .136 .603 

5 .264 397 


Again, 753 turns out to be pathological. 
The high-reinforcement member provides 
a reasonably good fit between obtained 
and expected proportions while the fit is 
very poor for the other two members. 
It is not readily apparent how the dis- 
criminations for the members of 753 are 
more difficult than those of 975. 

One other aspect of the differential 
discriminations among reinforcement 
groups is indicated by the curves of Fig. 
A, B, and C (see Footnote 2) which show 
the voting preferences of members in 
blocks of 40 trials throughout the 600 
trials. The members of 951 seem to 
arrive at their asymptotic values earlier 
than the others. 

It is obviously the case that the model 
predicts a faster rate of acquisition than 
is actually achieved. Considering the 
simplicity involved in the basic one- 
element pattern model, this result is not 
surprising. The rate at which a transi- 
tion matrix reaches a stable state is a 
function of the accessibility of the states 
to each other and the variability of the 
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transition probabilities as well as the 
initial probability vector (see Doob, 
1953). The theoretical learning process 
could be slowed down in the present case 
by shifting from the one-element pattern 
model to a multielement model and, also, 
by allowing for guessing states and states 
leading to no response. Future work will 
involve such effort. 

It should be noted that only the group 
performance success score (percent 
"Right") was displayed. The Ss had to 
calculate and remember the success 
ratios for the various group members 
without assistance. A separate score for 
each player might lead to even better 
correspondence of voting behavior to the 
model. 

One modification of the experimental 
procedure which would lend itself readily 
to pattern considerations and might well 
modify the rate of approach to asymptote 
is to have Ss vote for leader onceevery N 
trials, where N is large enough so that 
estimates of success rates of each member 


when leader could be made rapidly and 
accurately. 
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The effects of differential ratio of value, exposure time, and number of 


categories upon the detection an 
a visual search task. The differentia 
2:1, 4:1, 8:1, and 16:1; 
0.5, 1.5, and 2.5 sec. ; and the number 
8, 10, and 12 letters. 
identifications, initial responses, 
symbols than of lesser value. 


an 


d report of letters were investigated in 


l ratios of value of the letters were 


the exposure times of the stimulus slides were 


s of categories on a slide were 4, 6, 


The results indicated that Ss made more correct 


d false reports of higher value 


Further, the percentage of correct 


identifications and percentage of initial responses of higher value sym- 


bols varied with exposure time and ratio. 
ctive recall from short-term memory storage. 


to sele 


A study by Teichner, Reilly, and 
Sadler (1961) attempted to distinguish 
between perception and memory using 
a visual search task. The stimuli 
were slide projections which were ex- 
posed for a short period of time. Each 
slide contained a number (load) of 
different letters (categories) and re- 
peated each category a number of 
times (density). The task of each S 
was to search, detect, and report the 
different categories which were dis- 
played. It was assumed that percep- 
tion included the search, detection, 
and storage of information (memory 
Storage) which occurred during the 
exposure of the slide. Conversely, 
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These effects were attributed 


the concept of short-term memory 
was assumed to include the retrieval 
and report of letters from storage fol- 
lowing the cessation of the slide. It 
was assumed further that the relative 
differences in number of letters per- 
ceived and the number reported may 
be investigated by varying theamount 
of information to be remembered 
(memory load), i.e., the less the mem- 
ory requirement, the closer the ap- 
proximation to pure perception or de- 
tection. The results of this study 
suggested that humans can receive 
more information than they can recall 
and that the percentage of correct 
identifications was inversely propor- 
tional to the number of categories 
(load). In general, these results 
agreed with the findings obtained with 
tasks which did not involve search 
(Anderson & Fitts, 1958; Sperling, 
1960; Woodworth & Schlosberg, 
1954). Recently, Teichner and Sadler 
(1962) further found that accuracy in 
the search task was a negatively 
accelerated, increasing function of 
exposure time with an asymptote at 
about 2.5 sec. 

The present experiment was de- 
signed to extend the work of Teichner 
and Sadler (1962) by investigating the 
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effects of differential value of cate- 
gories upon S’s recall. More specifi- 
cally, the experiment was concerned 
with the relative differences in accur- 
acy of performance as differential 
value, memory load, and exposure 
time were varied. Investigations of 
decision-making situations (Tanner & 
Swets, 1954; Taub & Myers, 1961) 
suggested that in order to maximize 
performance, S’s responses should 
vary directly with the relative value 
of the categories. 


METHOD 


Apparatus.—The stimuli consisted of 50 
slides with white letters on a black back- 
ground, The slides varied in number of 
categories (load) with each letter of the 
alphabet representing a category. The five 
levels of load were 4, 6, 8, 10, and 12 letters 
per slide. Each level of load was replicated 
10 times with a different random sample of 
the alphabet. Density, the number of times 
a letter was repeated on each slide, was always 
constant at 2. To allow fora comparison of 
values, a distinction was made between the 
first and second halves of the alphabet. Each 
letter from A through K was worth one unit of 
value, while the letters P through Z had a 
different value. In order to provide a break 
between the two halves of the alphabet, the 
letters L, M, N, and O were not used on any 
of the slides. This provided two populations 
of category value, each containing 11 letters, 
All slides were designed to contain an equal 
number of letters from both halves of the 
alphabet. A description of the method of 
construction of the slides and a sample slide 
have been presented elsewhere (Taub & 
Teichner, 1963). 

A Revere automatic slide projector with an 
externally mounted shutter presented the 
slides and two electronic timers controlled 
the exposure times and interslide intervals. 
The projected size of the slide was 40 X 60 in., 
while the projected size of the letters was 1 in. 
The Ss sat in student arm-type chairs, in 
groups of one to four, such that the average 
distance from their eyes to the projected 

letters was 96 in. The Ss performed under red 
illumination provided by two 25-w. light bulbs 
and reported their answers on prepared data 
sheets. 

Procedure.—The 120 Ss were randomly 
assigned to 15 equal groups. The design is a 
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$ X 4 X 5 X 2 factorial with Exposure Time 
and Ratio of Value as "between effects" and 
Load and Category Value as “within effects." 
The exposure times were 0.5, 1.5, and 2.5 sec. ; 
the four differential ratios of value were 2:1, 
4:1, 8:1, and 16:1. Three equal-value (1:1) 
groups were run to be used in an evaluation 
of initial biases to the two halves of the 
alphabet. Load refers to the number of 
different categories per slide (4, 6, 8, 10, and 
12), while Category Value refers to the two 
populations of categories which were des- 
ignated as high and low value for each of the 
ratios. 

Each S served for approximately 40 min. 
During the first 15 min., Ss received practice 
in seeing slides, in writing answers during the 
interslide interval, and in differentiating be- 
tween the halves of the alphabet. Allinstruc- 
tions for the experimental period were given 
immediately following practice. The E ex- 
plained that points would be received for each 
letter reported correctly and that the number 
of points received for any particular letter 
was equal to the value assigned to the category 
Population in which it was contained. For 
example, in an 8:1 ratio group, if the letters 

-K were designated as the high-value 
categories and P-Z were the low-value cate- 
gories, then an S would receive 8 points for 
each letter within A-K that was correctly 
reported and 1 point for each correct report 
of the letters P-Z. The only restriction 
placed on the report of letters was that points 
would be received only for each different 
category that was reported. This restriction 
was necessary since each particular letter ona 
slide was always repeated twice. The E 
further explained that there was no Penalty 
or loss for incorrect or false reports. ‘That is, 
S was instructed to report each detection, 
even if some uncertainty was present, Al- 
though Ss knew the values of the categories, 
they were given no knowledge of the load, of 
the frequency of occurrence of categories, or 
of results, 

To eliminate possible biases in favor of the 
first or second half of the alphabet, the ratios 
of value were counterbalanced for the two 
halves. Thus four Ss within each group of 
eight had the first half of the alphabet as the 
higher value Categories, while the reverse was 
true for the other four. All Ss were Presented 
with all 50 slides in the same random order, 
A 10-sec. interslide interval was provided to 
permit a report of the letters Which were 
detected and recalled. Following the ex- 
perimental period, each .S was asked to record 
the method employed to search the slide. 
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Subjects—One hundred and twenty male 
volunteers, enrolled in the summer session at 
the University of Massachusetts, served as Ss. 
Each of the eight Ss within a group competed 
for a monetary award of $5.00 and two awards 
of $2.00. 


RESULTS 


The data were analyzed in terms of 
three measures: (a) the percentage of 
high- and low-value categories which 
were correctly reported, (b) the per- 
centage of times that the initial report 
was a high-value letter, and (c) the 
number of false reports of high- and 
low-value letters, i.e., the report ofa 
letter when it was not displayed. The 
analyses of these measures were con- 
cerned with the effects of differential 
value and did not include the results 
for the 1:1 groups. 

The results for the percentage cor- 
rect for each condition were obtained 
from the combined number of correct 
reports over the 10 slides in each of the 
levels of category. An analysis of 
variance was performed on the arc 
sine of these data. The results of this 
analysis are summarized in Table 1 
where it may be seen that the main 
effects of Exposure Time, Load, Cate- 
gory Value, and Ratio of Value were 
all significant. Of the first-order in- 
teractions, Load X Exposure Time, 
Category Value X Exposure Time, and 
Category Value X Ratio of Value 
were significant effects. The rest of 
the first-order interactions and all of 
the high-order interactions were not 


significant sources of variance. Sum- 


maries of the major significant effects 
1, 2, and 3. 


are presented in Fig. 

Figure 1 presents accuracy of per- 
formance as a function of load and 
exposure time. Inspection of this 
figure suggests that the amount of 
benefit derived from increasing ex- 
Posure time was inversely related to 
the number of categories displayed, 
and that percentage correct varie 
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TABLE 1 


SUMMARY OF ANALYSIS OF VARIANCE FOR 
Accuracy OF REPORT OF HIGH- AND 
Low-VALUE CATEGORIES 


Source df MS F 
Born Ss 95 
Exposure time (E) 2 3624.25 
Ratio of value (R) 3 $0528 | ^ $os* 
E EN R A 209.85 2.05 
Within Ss 864 iem 
Load (L) 4 26821.52 | 1431.00 
Category value 
1 8552.10 88.17** 
LXV 4 53.90 1.91 
LXE 8 1411.52 75.31** 
LXR 12 8. 42 
VXE 2 909.20 9.37** 
VXR 3 318.83 3.28* 
LXVXE 8 43.85 1.55 
LXVXR 12 7.65 27 
LXEXR 24 11.50 61 
VXEXR 6 88.98 91 
LXVXEXR 24 21.31 75 
Ss XL/E XR 336 18.74 
Ss X V/EXR 84 96.99 
Ss XL XV/E 
XR 336 28.15 
* p «.05. 
**p «.01. 


inversely with the number of cate- 
gories displayed (load) and directly 
with the length of exposure. 

Figure 2 shows accuracy of per- 
formance as a function of ratio of 
value, with category value as a 
parameter. Although the 1:1 groups 
were not included in the analysis of 
variance, these data are included in 
the figure to illustrate the level of per- 
formance for equal-value categories. 
A comparison of the 1:1 condition 
with the rest of the groups shows the 
strong effect of the addition of differ- 
ential category value. That is, the 
accuracy of report for the equal-value 
categories was always greater than for 
the low-value categories and less than 
for the high-value categories. 

The significant main effect of Ratio 
of Value may also be inferred from 
Fig. 2. Although there was a slight 
increase in accuracy with the 2:1 
ratio, performance decreased as the 
ratio was increased to 8:1. The in- 
crease in percentage correct at 16:1 
did not bring performance back to the 
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level attained with the 1:1 and 2:1 
ratios. Inspection of the Category 
Value X Ratio of Value interaction, 
omitting the 1:1 data, indicates that 
the decrement in overall performance 
was due to a reduction in accuracy 
for the lower value categories as the 
differential ratio of value increased. 
Thus, the figure suggests that the 
interaction may not represent an 
increase in accuracy of high-value 
letters, but a decrease in performance 
for the low-value letters. Evidence 
for this suggestion was obtained from 
the results of analyses of the simple 
main effects of ratio of value at each 
of the category value populations. 
For the high-value categories, in- 
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creasing the ratio of value from 2:1 to 
16:1 produced no significant difference 
in accuracy, while performance with 
the low-value categories was sig- 
nificantly affected, F (3, 84) = 5.81, 
b «.01. A Newman-Keuls test 
(Winer, 1962) of this significant effect 
revealed that mean performance for 
the low-value categories at 2:1 was su- 
perior to performance at 8:1 (p < .01) 
and 16:1 (p < .05) and also that a 
greater accuracy was attained with 
the 4:1 ratio than with the 8:1 
(p < .05) ratio. 

Figure 3 presents the interaction of 
Category Value with Exposure Time. 
It can be seen from this figure that the 
difference in percentage correct be- 
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FiG. 1. Percentage correct as a function of exposure time with load 
(number of categories) as a parameter. 
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x HIGH VALUE CATEGORIES 
O———©_ LOW VALUE CATEGORIES 


4:1 8 


RATIO OF VALUE 


Fic. 2. Percentage correct of high- 
tween high- and low-value categories 
increased directly with exposure time. 
The divergence of the lines suggests 
that an increase in exposure time 
gives greater benefit to performance 
with high-value categories than to 
performance with the low. 

The initial response data were based 
on a total of 10 possible responses in 
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and low-value categories as a function of ratio of value. 


each condition. Unlike total percent- 
age correct, the data for the high- 
and low-value initial responses did 
not vary independently, i.e., the scores 
for the low-value responses were equal 
to the difference between 100% and 
the percentage of high-value responses 
made. An analysis of variance per- 
formed on the arc-sine transformation 


X HIGH VALUE CATEGORIES 
o LOW VALUE CATEGORIES 


TUE T 2.5 


EXPOSURE TIME (SECONDS) 


Fig. 3. Percentage correct © 


f high- and low-value categories as a function of exposure time. 
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Fic. 4, Percentage of initial responses of high-value categories as a function 
of differential ratio of value with exposure time as a parameter. 


of the initial responses to the high- 
value letters indicated significance for 
Exposure Time (E), F (2, 84) — 3.31, 
p «.05 and Ratio of Value (R), 
F (3, 84) = 6.08, p < .01. The main 
effect of Load and the interactions 
among E, R, and L were not signifi- 
cant sources of variance. 

Figure 4 shows the percentage of 
initial responses of higher value letters 
as a function of ratio groups with 
exposure time as a parameter. In- 
spection of this figure indicates that 
all groups made initial responses of 
high-value categories more than 50% 


TABLE 2 


SUMMARY OF SEARCH METHODS EM PLOYED BY 
EACH S ar EACH EXPOSURE TIME 


Methods of Search 


Exposure 
Time Restricted Enti: 
Area Seren Scanning 
0.5 26 6 8 
1.5 8 3 29 
2:5 3 0 37 


of the time and that the percentage of 
these responses varied directly with 
the differential ratio of value. Figure 
4 further suggests that the initial 
report of high-value letters was a 
maximum at 1.5 sec. exposure. A 
Newman-Keuls test showed that a 
significantly greater percentage of 
reports was made at 1.5 sec. than at 
0.5 sec. (p < .05). The difference in 
performance between the means at 
1.5 and 2.5 sec. and at 0.5 and 2.5 sec. 
was not significant. This suggests 
that 1.5 sec. may be either an opti- 
mum exposure time or an asymptote 
of higher value initial responses. 

In general, the total number of false 
reports was relatively small in com- 
parison to the number of correct re- 
ports or the number of categories 
omitted from reports. The mean 
number of false reports pooled over all 
50 slides was 9.51 indicating that Ss 
made approximately 1 false report for 
every 5 slides that they observed. An 
analysis of variance of the total num- 
ber of high- and low-value false reports 
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summed over all 50 slides revealed 
that the only significant source of 
variance was Category Value, F (1, 84) 
= 5.51, p < .05. The mean number 
of false reports of high-value letters 
was 10.39, while the mean for the low- 
value categories was 8.64. The non- 
significant Category Value X Expo- 
sure Time and Category Value 
X Ratio of Value interactions suggest 
that the differential false report re- 
sponse to category value was not 
affected by increasing exposure time 
or ratio. 

Table 2 presents a summary of the 
responses to the question concerning 
the method of search behavior. The 

restricted area” responses refer to 
statements indicating an inability to 
search more than a small portion of 
the slide, “Entire screen" refers to 
those statements which indicated that 
S tried to fixate on the whole screen. 
Finally, "scanning" included those 
responses which indicated that S 
searched or attempted to search the 
whole Screen in some systematic 
fashion. Inspection of Table 2 in- 
dicates that method of search was 
affected by the length of slide expo- 
Sure. With an exposure time of 0.5 
Sec., most Ss were able to search only 
a limited area of the slide, while at 1.5 
and 2.5 sec., the majority of Ss were 
able to systematically scan the slide. 
This suggests that the major effect of 
Increasing exposure time was to allow 
Ss to search a larger area and thus to 
detect more of the letters. 

Although the ratio of values for the 
two halves of the alphabet were 
counterbalanced to eliminate the pos- 
Sible effects of preferences, two anal- 
yses were run to determine if any 
Preferences did, in fact, exist. The 
dependent measures for these analyses 
Were the percent correct of first and 
Second half categories and the per- 
Centage of initial responses of cate- 
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gories within the first half of the 
alphabet. The 1:1 groups were in- 
cluded in these analyses since the 
objectives were to analyze the effects 
of category halves and not differential 
value. With percent correct as the 
dependent measure, the preferences 
for the two halves of the alphabet 
varied with both number of categories 
and exposure time, F (8, 420) = 3.84, 
p «.01. The increase in exposure 
time from 0.5 sec. to 2.5 sec. elimi- 
nated a first half preference with 4 and 
6 categories and produced a second 
half preference for 8 and 10 categories. 
The slight preference for the second 
half of the alphabet with 12 categories 
did not change. In general, the results 
for the initial reports indicated first 
half preferences with 4, 6, and 8 
categories, second half preferences 
with 10 and 12 categories, but re- 
vealed no consistent trends with 


exposure time. 


DISCUSSION 


The results of those effects which did 
not include differential value are in com- 
plete agreement with previous studies 
(Teichner & Myers, 1961, 1962; Teichner 
et al., 1961; Teichner & Sadler, 1962). 
Further, the data suggest that differential 
value did have an effect upon Ss’ reports. 
Evidence for this effect comes from the 
fact that Ss made more correct identifica- 
tions, initial responses, and false reports 
to the higher value categories than to 
those of lesser value. 

Although the results supported the 
hypothesis of differences in accuracy due 
to differential value, the ratio of value 
effect and the exact shape of the category 
value by ratio of value interaction were 
not expected. The findings of previous 
studies indicate that the number of cor- 
rectly reported stimuli remains invariant 
over a wide range of conditions (Sperling, 
1960) and that percentage correct re- 
mains fairly constant for any particular 
combination of exposure, load, and 
density (Teichner & Myers, 1961, 1962). 
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Using these studies as the basis for 
prediction, it was assumed that overall 
accuracy would not vary as ratio of 
value was increased, and that every 
increase in accuracy for the higher value 
categories would be accompanied by a 
corresponding decrease in performance 
for the lower value categories. The 
results indicate that although there was 
a decrement in performance for the lower 
value letters, there was no corresponding 
increment for the higher value categories. 
This loss in accuracy without any com- 
pensating increment resulted in a decre- 
ment in overall performance as the ratio 
of value was increased. Thus, the data 
suggest that an increase in differential 
ratio does not seem to facilitate reports 
of the more important targets, but rather 
inhibits the reports of ones of lower value. 
The only dependent measure which 
was not affected by increasing the differ- 
ential ratio of value and exposure time 
was the number of false reports. Since 
the present experiment provided an in- 
centive for the higher value categories 
and no penalties for false reports, the 
results were contrary to predictions based 
on the theory of visual signal detection 
(Tanner & Swets, 1954). On the other 
hand, the data agree with a study that 
found that a set to guess did not affect 
the number of letters reported in the 
visual search task (Teichner & Myers, 
1962). This discrepancy in findings sug- 
gests either that signal detection theory 
does not apply to the present search task 
in which each letter was clearly dis- 
criminable, or that the differences were 
caused by variations in response criteria. 
With respect to the response criteria, 
the present study provided no informa- 
tion concerning the number of categories 
to be presented and placed no restrictions 
on the number of responses, while the 
study of signal detection used forced- 
choice responses. It is possible that if a 
specified number of reports was required 
in the search task, then the number of 
false reports would vary with differential 
ratio of value. 
Studies by Teichner, Reilly, and 
Sadler (1961) and Sperling (1960) have 
demonstrated that there is a difference 


HARVEY A. TAUB 


between the amount of information 
received and stored, and the number of 
letters which can be recalled. Teichner 
and Myers (1961, 1962) assumed that 
every letter that is perceived enters into 
short-term memory storage and that 
recall represents the difference between 
storage and memory loss due to some 
process going on in the storage. This 
assumption implies that differential value 
may have affected either perception or 
recall. Since the present experiment 
used recall as the measure of perform- 
ance, the effects of selective perception 
of the categories and of selective recall 
from memory storage were not separable 
and the relative importance of these 
factors could not be evaluated. 

On the other hand, if a major selective 
process in perception was involved, then 
the size of the difference between high- 
and low-value letters should vary in- 
versely with length of exposure. That is, 
a perceptual set, which would make the 
higher value categories easier and quicker 
to identify (Bruner, 1957), would occur 
at those conditions where there is little 
time to detect and discriminate between 
the categories, but have little effect at 
longer exposures where all letters would 
be equally discriminable. Predictions 
from a selective recall point of view 
would be the opposite. That is, as 
exposure time increases and the rate of 
memory loading decreases, Ss would have 
more time to selectively code the cate- 
gories which are detected, Thus, a 
selective memory process explanation 
would predict increasing differences in 
response to value as exposure time in- 
creases. Since the results support this 
latter hypothesis, the evidence is in 
avor of a selective Process between 
memory storage and recall. The ex- 
planation in terms of a perceptual bias 
would also predict that frequency of false 
reports would vary directly with differ- 
ential ratio of value and inversely with 
exposure time. Although significantly 
more false reports were made to the 
higher value categories, the small number 
of reports and the lack of significant 
changes with exposure time and ratio of 
value suggest that selective perception 
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was not a major factor in the present 
study. Further, the verbal reports of Ss 
suggest that lengthening slide exposure 
did not seem to aid perception, but only 
allowed more letters to be detected 
through increased scanning. 

Finally, the choice and arrangement of 
the stimulus materials were designed to 
minimize the effects of a selective percep- 
tion. All letters were of equal size and 
were clearly visible and discriminable; 
and each slide contained an equal number 
of randomly selected and randomly 
positioned letters from the two halves of 
the alphabet. "Thus, it is assumed that 
an equal number of high- and low-value 
Categories was detected and that the 
differential effects were due to a selective 
decision which occurred after the devel- 
opment of memory storage. This selec- 
tive process is assumed to result in a 
coding of inputs and subsequent retrieval 
in favor of the more important categories. 

Although the results support the as- 
Sumption of a selective recall from 
memory storage, other findings in favor 
Of selective encoding in perception 
(Harris & Haber, 1963) indicate a need 
for research to evaluate the relative 
Importance of each of the hypothesized 
Selective processes. In order to ade- 
quately assess the selective aspects of 
Perception and memory, provision must 

€ made, within a single task, to separate, 
Control, and measure the amount and 
Complexity of information which is 
Perceived and stored and the amount of 
information which is retrieved and re- 
Called. This could be accomplished by 
adapting the method utilized by Teich- 
ner, Reilly, and Sadler (1961) for the 
Search task, or by presenting the stimuli 
Sequentially, Further, research is neces- 
Sary to determine the effects upon per- 
Ormance with reward for correct re- 
SPonses and penalties for errors of 
Omission and commission. Finally, re- 
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search on response criteria should be 
performed to relate the present findings 
to signal detection theory. 
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CONCEPT LEARNING AND VERBAL CONTROL UNDER 
PARTIAL REINFORCEMENT AND SUBSEQUENT 


REVERSAL OR NONREVERSAL SHIFTS! 


DANIEL C. O'CONNELL ? 
University of Illinois 


Concept learning was studied with partial or continuous reinforcement, 
reversal or nonreversal shift, and reinforcement for placement or reason. 
Ss placed cards right or left and gave trial-by-trial reasons; half were 
reinforced for placements, half for reasons. In acquisition, half re- 
ceived 100%, half 60% reinforcement, until each reached 50 reinforce- 
ments. Shift period was 80 trials; half the Ss were reinforced for 
reversal, half for nonreversal shift. In acquisition and shift period, 
PRE was found. Reinforcement for placement or reason differed 
significantly. Reversal or nonreversal shift gave inconclusive evidence 
concerning mediation theory. A theory of verbal control accurately 


predicted correct placements from verbalized reasons. 


Number of 


nonreinforced acquisition trials correlated negligibly with shift-period 


response measures. 


Investigations concerned with par- 
tial reinforcement of the stimulus 
dimension relevant during acquisition 
of a concept by human Ss have yielded 
rather complex results. Evidence re- 
garding the existence of a partial- 
reinforcement effect (PRE) remains 
in several respects negative or in- 
conclusive. 

Buss (1952) found /ess resistance to 
extinction after partial reinforcement 
(50%) than after continuous. He 
noted that the experiment differed 
from a standard extinction experi- 
ment in that the response was at- 
tached to a new stimulus while being 
extinguished on the first. Still he 
could find no obvious explanation for 
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the prolonged extinction after con- 
tinuous reinforcement. 

Sax (1960) utilized the same per- 
centages of reinforcement as Buss and 
four delays of reinforcement. His 
measure of habit strength at the end 
of acquisition was the number of 
trials needed to reach a criterion of 
perfect performance. Neither sched- 
ule of reinforcement nor its inter- 
action with delay of reinforcement 
proved to be significantly related to 
this measure. Retention after 2 wk. 
rather than a standard extinction 
procedure was the second measure of 
habit strength. Schedule of rein- 
forcement was found to be unrelated 
to this measure also. 

Erlebacher and Archer (1961) found 
that, as percentage of reinforcement 
increased (25%, 50%, and 100%) 
during acquisition, number of correct 
responses, number of errors, and total 
number of responses decreased. Their 
Ss were required to reach a eR, A 
learning defined by a criterion of 1 
20, or 40 consecutive correct erbe 
They considered a nonreversal shift 
analogous to an extinction conditio? 
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and hypothesized accordingly that 
number of perseverative errors would 
be inversely related to percentage of 
reinforcement. This second aspect 
of PRE was not confirmed, although 
there did appear a significant inter- 
action between percentage of rein- 
forcement and degree of learning, 
with perseverative errors as the re- 
sponse measure. During acquisition, 
however, increase in percentage of 
reinforcement was confounded with a 
decrease in number of reinforcements 
—a problem in partial-reinforcement 
studies to which Kanfer (1954) has 
called attention. In the present 
instance, it renders the negative re- 
sults and the already complex inter- 
action difficult to interpret. 

The analogy of reversal and non- 
reversal shifts to an extinction condi- 
tion makes the relative difficulty of 
the two types of shift relevant. 
Kendler (1960) has reviewed the 
evidence that the majority of adult Ss 
learn a reversal shift rather than a 
nonreversal shift. The former re- 
quires a shift only in the overt 
response and not in the postulated 
mediating response, and this accounts 
for its relative facility. Mediation 
theory also predicts positive transfer 
in a reversal shift as compared with 
a control, but Isaacs and Duncan 
(1962) have obtained negative results. 

It is also of interest to inquire 
Whether PRE can be found when 
reinforcement is applied to verbal 
rather than motor responses. 
design which can be adapted to test 
Such a hypothesis has been presented 
by Verplanck (1962). He found evi- 
dence of the dissociation of motor 
Card placements) and verbal (trial- 
by-trial hypotheses) responses under 
Partial reinforcement (60%) after 
acquisition under continuous rem- 
Orcement. Dulany (1962), on the 
Other hand, has presented a theory of 
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verbal control which predicts no dis- 
sociation under such conditions, and 
Dulany and O’Connell (1963) have 
presented evidence to the same effect 
from a replication of Verplanck’s 
research. 

In both the Sax (1960) and Erle- 
bacher and Archer (1961) studies, the 
failure to control for the number of 
reinforcements made the meaning of 
the negative results regarding PRE in 
the second phase of the experiments 
unclear. Moreover, Sax used a reten- 
tion measure, and Erlebacher and 
Archer used only a nonreversal shift. 
In this study, it is expected that PRE 
will be verified in both acquisition and 
shift period, with reinforcement con- 
tingent on motor or verbal responses. 
The negative findings of Isaacs and 
Duncan (1962) suggest the further 
hypothesis that reversal and non- 
reversal shifts will not prove signifi- 
cantly different. Finally, dissociation 
of placement and reason is expected 
in none of the conditions in conse- 
quence of the various reinforcement 


contingencies. 


METHOD 


Slimuli.—A set of 320 3 X 5 in. cards was 
used. The figures thereon differed along four 
dimensions: shape (20 different kinds), posi- 
tion (top or bottom of the card), color (red, 
black, blue, or green), and number (one or 
two figures per card). Order of presentation 
was the same for all Ss; each block of 20 cards 
had one card with each of the 20 figures, 5 of 
each color, 10 with figures in the top and 
bottom positions, respectively, and 10 with 
one and two figures, respectively. Within 
each block, order of presentation was ran- 
domized. 

One block of 20 cards served as a sample 
for Ss during instructions, and an extra card 
covered the deck. The other 300 cards were 
piled face up in order before S. To the left 
and right were receptacles for the cards. 

Subjects —The sample consisted of stu- 
dents in an introductory psychology course. 
The 20 Ss in each of the eight experimental 
conditions were randomly assigned. 


146 


Procedure.—Each S was scheduled in- 
dividually for an hour. He was seated at a 
table with E at his side and somewhat behind 
him. A screen hid E's data sheets. In- 
structions were as follows: 


The experiment you're helping me with 
is essentially a card sorting task; the cards 
can be sorted systematically into two stacks 
in a number of ways. You are simply to 
take each card in turn, decide whether to 
place it on the right or left stack, and give 
the reason why the card should be so placed. 
Tell me your reason in the following way: 
“— figures go to the right (left)." Mean- 
while place the card accordingly and take 
the next card. The blank space is to be 
filled in with a single word descriptive of the 
card. Your reason might be, for example, 
“Green figures go to the left," or "Ty. 
angular figures go to the right." What the 
real reasons are you must find out for 
yourself in the course of the experiment. I 
will reply with either "yes" or silence after 
each trial. You are to try to get as many 
"yeses" as possible, Here are some 
sample cards from the deck. Look through 
them to familiarize yourself with the type 
of card you will be dealing with during the 
experiment. Now to make sure you under- 
stand how the cards can be sorted into two 
piles, notice that the samples include green 
figures, red figures, black figures, and blue 
figures; single figures and double figures; 
top figures and bottom figures; and a number. 
of types of figures, or shapes. These are the 
dimensions you are to use in filling in the 
blank space in the formula when you give 
your reasons. 


The S's questions were then answered, 
examples of all possible logical reasons were 
given with the extra card, and S was reminded 
of the nature of his task. He then proceeded 
through the cards, pacing himself and without 
interruption, and filled Out a brief post- 
experimental questionnaire. 

Conditions.—Throughout the experiment, 
the P group (80 Ss) were reinforced (“yes” 
from E) for correct placements irrespective of 
their reasons; the H group (80 Ss) were rein- 
forced only if the reason for their placement 
was exactly correct. During acquisition, each 
group was subdivided into groups with 100% 
and 60% reinforcement, respectively. In the 
latter group, E's "yeses" were given in a 
randomized three out of each series of five 

correct responses (placements or reasons). 
Acquisition lasted until S had been reinforced 
50 times. Then all groups were shifted with- 
out notice to 100% reinforcement and sub- 
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divided into reversal- and nonreversal-shift 
groups for 80 more trials. 

During acquisition, correct placements 
were: cards with one figure to the right; cards 
with two figures to the left. During the shift 
period, correct placements for the reversal- 
shift group were: cards with one figure to the 
left; cards with two figures to the right. For 
the nonreversal-shift group, they were: cards 
with figures at the top to the right; cards with 
figures at the bottom to the left. 

Supplementary conditions.—An additional 
30 Ss, volunteers from Harvard and Radcliffe 
Colleges, were assigned to several of the 
experimental conditions in order to obtain 
further evidence regarding verbal control. 
From the instructions above, the sentence 
requiring that a single word be used, the 
sentence beginning “What the real reasons 
are," and the last two sentences were deleted. 
Twenty Ss were assigned to the P group and 
evenly subdivided into groups with 100% and 
60% reinforcement, respectively. The group 
with 100% reinforcement was given acquisi- 
tion and nonreversal-shift trials; the group 
with 60% reinforcement was given acquisition 
trials only. The remaining 10 Ss were 
assigned to the H group and given acquisition 
trials with 60% reinforcement. 


RESULTS 


Learning during acquisition.—A 
number of response measures indi- 
cated terminal level of learning in each 
of the four acquisition groups: number 
of Ss whose last 20 placements were 
all correct; number of Ss whose last 
20 reasons were all COrrect; mean 
number of correct placements in the 
last 20 trials for all Ss; and mean 
number of correct reasons in the last 
20 trials for all Ss. For all these re- 
Sponse measures, the differences be- 
tween the 100% and 60% reinforce- 
ment conditions were significant in 
both P and H groups. 

The binomial test was applied to 
the differences in number of Ss whose 
last 20 placements were all correct: 
for the H groups (39 and 14), p < .001; 
for the P groups (26 and 2), p « .001. 
The binomial test was similarly ap- 
plied to the differences in number of 
Ss whose last 20 reasons were all 
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TABLE 1 


MEAN RESPONSE MEASURES DURING SHIFT PERIOD 


7 Placements Rea: i 7 i 
Group N Correct Correct Ren ENS ES 
H-100-R 20 73.00 69.20 3.15 7.00 
H-100-N 20 74.70 71.25 3.10 3.05 
H-60-R 20 55.05 51.80 20.55 24.95 
H-60-N 20 55.35 35.20 26.40 18.25 
P-100-R 20 67.20 54.25 3.75 12.80 
P-100-N 20 63.90 50.35 7.70 8.30 
P-60-R 20 38.40 9.55 10.10 41.60 
P-60-N 20 49.60 19.75 18.40 22.60 


correct: for the H groups (35 and 9), 
P < .001; for the P groups (23 and 1), 
P « .001. 

The ¢ test was applied to the mean 
number of correct placements in the 
last 20 trials for all Ss: for the H 


groups (19.95 and 17.95), ¢ (38) 
= 447, p < .01; for the P groups 
(18.02 and 14.52), t (38) = 4.32, 
P < .01. The ¢ test was similarly 


applied to the mean number of correct 
reasons in the last 20 trials for all Ss: 
for the H groups (19.72 and 15.82), 
t (38) = 5.93, p < .01; for the P 
Sroups (14.95 and 5.95), ¢ (38) = 6.55, 
P< 01. 

Learning during shift period.—The 
Main response measures for the shift 
Period were: correct placements, cor- 
Tect reasons, perseverative reason 
errors, and perseverative placement 
errors. Means are presented sum- 
marily in Table 1. 

A 2X2 X2 factorial analysis of 
Variance for correct placements in- 
dicated two significant main treat- 
Ment effects: for the PRE, F (1, 152) 
= 93.09, p < .01; for the H-P effect, 
F (1, 152) = 16.14, p < .01. Parallel 
Tesults were found in the analysis 
Or correct reasons: for the PRE, 
T (1,152) = 77.03, p < .01; for the 
T-P effect, F (1, 152) = 40.67, p < -01. 
In both these analyses, Bartlett's test 
dicated heterogeneity of variance: 
X (7) = 26.18, p < .01; and x? (7) 


= 27.82, p < .01. Hence the level 
accepted as significant was .01 in both 
instances (Lindquist, 1953). 

In the analysis for perseverative 
reason errors, however, a transforma- 
tion (log of score plus one) was 
effective in eliminating heterogeneity. 
In this analysis, the significant results 
were: the PRE, F (1,152) = 69.97, 
p «.01; the H-P effect, F (1, 152) 
= 4.99, p < .05; the reversal-non- 
reversal effect, F (1,152) = 5.83, 
p < .05; and the triple interaction, 
F (1, 152) = 18.10, p < .01. 

The experimental design allowed 
only half as many perseverative place- 
ment errors in nonreversal as in 
reversal conditions. Hence a direct 
comparison of these conditions was 
not feasible, and two separate 2 X 2 
factorial analyses were required. In 
the analysis for reversal conditions, 
the two main treatment effects were 
significant: for the PRE, F (1, 76) 
= 52.26, p < .01; and for the H-P 
effect, F (1, 76) = 12.05,p < .01. In 
the analysis for nonreversal condi- 
tions, only one main treatment effect 
was significant: for the PRE, F (1, 76) 
= 63.27,p < .01. In the analyses for 
both reversal and nonreversal condi- 
tions, Bartlett's test indicated hetero- 
geneity of variance: x? (3) = 13.85, 
p < .01; and x? (3) = 20.61, p < .01. 
Again, the level accepted as significant 
was .01 because of the heterogeneity. 
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Similar comparisons were made by 
applying the binomial test to the 
number of Ss in the various conditions 
to reach 10 or more successive correct 
placements or correct reasons. In 
terms of both criteria, the PRE was 
significant (73 and 40; 69 and 34), 
P < .01, as was also the H-P effect 
(70 and 43; 69 and 34), p < .01. The 
frequencies for the reversal-nonre- 
versal effect (56 and 57; 55 and 48) 
were not significant. 

The number of Ss whose first shift 
in reason during the shift period was 
a reversal was significantly different 
in the H and P conditions (38 and 11) 
by the binomial test, p < .001. This 
result is hardly surprising, since 
learning of reasons during acquisition 
reached a higher level in the H than 
in the P conditions. The number of 
Ss in the H condition whose first shift 
in reason during the shift period was 
a reversal and nonreversal, respec- 
tively (38 and 41) was then compared 
by means of a 2X2 contingency 
table with the frequencies for these 
same reasons in the initial responses 
of the experiment (41 and 86). The 
shift-period frequencies were found to 
be significantly different from the ex- 
pected frequencies: x? (1) = 16.86, 
p < .001. 

Verbal control.—Miscategorizations 
(S named a figure erroneously, eg., 
called a green figure red) occurred on 
only 6, or .02% of the 29,874 trials of 
the experiment. Misplacements [&3 
placed card on one side while saying 
he was placing it on the other side) 
occurred on 58, or .19% of the trials. 
Misplacements for the 100% and 60% 
reinforcement conditions during ac- 
quisition were 25 and 10, respectively. 
However, the larger figure was con- 
tributed to by multiple misplacements 
of several Ss. The cumulative dis- 
tributions of 5s making various num- 

bers of misplacements in these condi- 
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tions were compared by means of the 
Kolmogorov-Smirnov two-sample test, 
and the differences proved nonsignifi- 
cant: D = .0875, p > .10. 

A formula for expected correct 
placements (Pg) was adapted from 
Dulany and O'Connell (1963). It 
relies on the following assumptions: 
Correct reasons control correct place- 
ments; Incorrect reasons control in- 
correct placements; Irrelevant reasons 
control correct and incorrect place- 
ments with equal frequency. Corre- 
lated reasons (e.g., reasons correct but 
for the addition of superfluous ele- 
ments) had proved an important com- 
ponent of the original formula, but 
were excluded from the adapted 
formula because Ss were free to use 
only a single word descriptive of the 
card in their reasons. This constraint 
was introduced in order to limit the 
experimental time for each S to 1 hr. 
and avoid nonlearners. Without such 
a constraint, Verplanck (1962) and 
Dulany and O'Connell (1963) had 
found it necessary to discard 17 and 
25 Ss, respectively. 

Correct and incorrect numerical 
reasons were coded as Hy and He, 
respectively; correct and incorrect 
position reasons as Hp and Hy, re- 
spectively. Irrelevant color and shape 
reasons were coded as Hg and H S, 
respectively. For conditions in which 
the correct reason was numerical, the 
adapted formula was: 


Ps — Hy + .5(Ho+ Hs 4- Hp + H3) 


For conditions in which the correct 


reason involved position, the adapted 
formula was: 


Ps = Hp .5(Hy + H,-- Ho + Hs) 


The ! test for related measures was 
applied to the differences between ex- 
pected and observed placements for 
each of the four acquisition and eight 
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shift-period conditions. In three ac- 
quisition conditions, the differences 
proved significant: for the 60% H 
condition, ¢ (39) = 3.62, p < .001; 
for the 100% P condition, : (39) 
= 3.68, p < .001; and for the 60% P 
condition, ¢ (39) = 4.70, p < .001. 
One of the shift-period conditions 
proved significant: for the nonreversal 
100% P condition, ¢ (19) = 2.11, 
P « .05. 

Nonreinforced trials during acquisi- 
tion.—For each condition, correla- 
tions were computed between number 
of nonreinforced trials during acquisi- 
tion and each response measure of the 
shift period: correct placements, cor- 
rect reasons, perseverative reason 
errors, and perseverative placement 
errors. Of the 32 correlations, 7 
Proved significant, and of these, 2 
were positive, 5 negative. They were 
distributed over conditions with no 
discernible regularity, even in the case 
of perseverative errors, for which the 
analogy to a standard extinction con- 
dition is most relevant. 

Supplementary | conditions.—For 
these data, the original formula for ex- 
Pected correct placements (Dulany 
& O'Connell, 1963) was applicable. 
~ifting the restriction to a single word 
In filling in the blank space made it 
Necessary to discard 10 nonlearners; 

ut it also made it possible for S to 
Verbalize reasons which, although too 
Specific to be scored as logically cor- 
Tect, could be expected to have a per- 
€ct positive correlation with correct 
Placements, The / test for related 
Measures was applied to the differ- 
ences between expected and observed 
Placements for the three supplemen- 
tary acquisition conditions and the 
9ne supplementary nonreversal-shift 
Condition. These were the very con- 
litions for which the ¢ tests proved 
Significant in the results of the main 
€xperiment, and yet none of the ¢ 
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tests in the present instance even ap- 
proached significance. 


Discussion 


Partial-reinforcement effect. — The 
present results clearly indicate PRE in 
both acquisition and shift period. Every 
one of the placement and reason response 
measures indicates the effect at the .01 
or .001 level of significance. During ac- 
quisition, terminal level of training is 
consistently higher under continuous 
than under partial reinforcement. Dur- 
ing the shift period, resistance to extinc- 
tion is consistently higher in the group 
which had partial reinforcement during 
acquisition than in the group which 
had continuous reinforcement during 
acquisition. 

Lawrence and Festinger (1962) have 
recently rejected the explanation of PRE 
in terms of contrast between acquisition 
and extinction trials and explained the 
phenomenon as a function of the total 
number of unrewarded trials. In the 
results of the present experiment, how- 
ever, there are no significnat correlations 
between number of nonreinforced trials 
during acquisition and any response 
measure for the shift period which dis- 
close such a systematic or consistent 
relationship. The few correlations which 
did prove significant reflect the trivial 
fact that individual Ss are consistently 
good or poor in their performance during 
acquisition and the shift period. It 
should be noted, however, that the 
similarity of shift periods to an extinction 
condition is analogous; the evidence 
found in this experiment cannot be ex- 
tended to a standard extinction condition 
in every respect without further evidence. 

Reversal and nonreversal shifts—The 
evidence of the present experiment does 
not lend support to the hypothesis that a 
reversal shift will yield faster or better 
learning than a nonreversal shift. Only 
the analysis for perseverative reason 
errors yields a significant reversal-non- 
reversal effect, but a study of the means 
in Table 1 shows that the largest reversal- 
nonreversal differences appear in the 
partial-reinforcement groups. Duncan's 
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multiple-range test shows that these are 
also the differences which account for 
the significant effect. In terms of the 
mediation hypothesis, this would mean 
that, where the mediator was learned 
poorly during acquisition, reversal and 
nonreversal shifts differentiate subse- 
quent levels of learning the better; where 
the mediator was learned well, there 
appear no significant differences between 
the reversal and nonreversal groups. 
Even this isolated significant result is, 
therefore, exactly the opposite of what 
the mediation theory would hypothesize. 

It is plausible to suggest that the 
failure to find significant differences be- 
tween reversal and nonreversal shifts 
might be a consequence of the different 
salience of the dimensions relevant in 
the two stages of the experiment. Since 
these dimensions were not varied, they 
were indeed confounded with the shift 
variable; the relevant dimensions for 
reversal and nonreversal shifts were al- 
ways numerical and positional, respec- 
tively. If position were a more salient 
dimension than number, the nonsignifi- 
cant result could be expected. But the 
initial responses of the experiment in- 
dicate no such salience. On the contrary, 
74 Ss gave numerical reasons as their 
first response, and only 19 gave positional 
reasons. Hence, the salience of the 
relevant dimensions would have inflated 
any difference in favor of reversal shift 
over nonreversal shift. 

The evidence does support the media- 
tion hypothesis, however, insofar as the 
number of Ss in the H conditio 1i whose 
first shift in reason was a reversal was 
significantly greater than expected in 
accord with the salience of these same 
reasons in the initial responses of the 
experiment. Apart from the salience of 
dimensions, we should expect only 11 
reversal-shift responses rather than the 
38 observed in the 79 first shifts in 
reason, if a reversal shift occurred only 
as frequently as each of the other six 
available (nonreversal) shifts. The pre- 
ponderance of reversals among these 
first shifts in reason is, therefore, all the 


more impressive. 
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Verbal control.—Dulany and O'Connell 
(1963) found that miscategorizations 
accounted for what appeared to be 
dissociation in Verplanck’s research. 
And the miscategorizations were due in 
turn to ambiguous stimuli. No mis- 
categorizations occurred in their control 
experiment with unambiguous stimuli. 
In the present experiment, miscategoriza- 
tions and misplacements do occur, but 
so minimally as to have little effect on 
the accuracy of the formula for expected 
placements. 

But the adapted formula does not 
accurately predict in all cases. The 
failures in prediction can well be ascribed 
to the inadequacy of the formula and the 
corresponding constraints placed upon 
the experimental design, rather than to 
any weakness inherent in a theory of 
verbal control as such. This suggestion 
is convincingly borne out by the results 
of the supplementary conditions. When 
Ss were allowed to verbalize correlated 
reasons, they did so on 27% of the trials. 
And when these correlated hypotheses 
were in turn taken into account in the 
original formula for expected correct 
placements, all the significant differ- 
ences between expected and obtained 
correct placements disappeared. It is 
clear that the adapted formula did not 
provide adequate evidence of verbal 
control precisely because Ss could enter- 
tain hypotheses which they were not 
allowed to verbalize. The constraints 
placed on verbalization served only to 
preclude the possibility of gathering ade- 
quate evidence regarding verbal control. 

In both acquisition and shift period, 
therefore, with reinforcement contingent 
on either verbal or motor responses, PRE 
can be found in concept learning. No 
dissociation due to reinforcement con- 
tingencies was found, and evidence re- 
garding the relative facility of reversal 
and nonreversal shifts was mixed. 
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VERBAL OPERANT CONDITIONING! 


PAUL W. DIXON axb WILLIAM F. OAKES 
University of Hawaii 


4 groups of 25 Ss constructed sentences using a verb and 1 of 6 pronouns 
presented on an index card in a series of 6 blocks of 20 trials Ina2 X2 
design, 2 of the groups, RN and RC, were reinforced for sentences begun 
with "I" or “we,” while the other 2 groups, UN and UC were not rein- 


forced. Groups RN and UN named colors between trials, while RC 


and UC had no such intertrial activity. 


After the conditioning trials 


an "awareness" interview was conducted. The reinforced groups in- 
creased significantly in number of “I-we” sentences constructed and did 


not differ in the amount of conditioning. 


The correlation usually found 


between “awareness” and conditioning scores was found in Group RC, 
but a nonsignificant correlation in the opposite direction was found for 


Group RN. 
different.) 


(The correlations for 


the conditioning, 


RN and RC were significantly 


It was concluded that the intertrial activity (color naming) 
interfered with the "awareness"-conditionin 


per se, thus suggesting 


g relationship, but not with 
an automatic strengthening 


effect of the reinforcement, not mediated by Cognitive processes, Ls, 


The 


"awareness," 


results were interpreted as not supporting the 


Dulany-Spielberger position on the relationship between "awareness" 
and conditioning, but rather às supporting the Verplanck-Krasner 


position. 


A question which has arisen from 
the research on verbal operant condi- 
tioning (VOC) has to do with the role 
of "awareness," usually indicated by 
retrospective verbal report, in such 
learning. The early investigators in 
this area (e.g., Greenspoon, 1955; 
Taffel, 1955) eliminated Ss from the 
data analysis whose interview results 
indicated "awareness" (of a response- 
reinforcer contingency), and thus 
implied that this learning had oc- 
curred without awareness. Verplanck 
(1962) and Krasner (1962) have held 
that there is no necessary relationship 
between awareness and VOC, i.e., 
that the effect of the reinforcer is 
direct, and not necessarily mediated 
by cognitive processes. 

a different analysis was made, 
Nec herein were obtained by the 
first author for his MA thesis under the 
supervision of the second author. This report 
was prepared by the second author. 
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Other investigators (e.g., Dulany, 
1961, 1962; Spielberger, 1962), pri- 
marily on the basis of repeated find- 
ings of a positive relationship between 
degree of "awareness" and amount of 
VOC for individual Ss, have ques- 
tioned the conclusion of the early 
VOC investigators, and, in contrast 
to the Verplanck-Krasner position, 
Suggested that VOC is necessarily 
mediated by cognitive processes, i.e., 
"awareness" of a response-reinforce- 
ment contingency (e.g., Spielberger, 
1962, p. 95). 

Dulany (1962) has presented a theo- 
retical network Which breaks down 
what others call "awareness" (which 
he conceives of as a self-instructional 
Set mediating the response in VOC) 
into three concepts: Reinforcement 
Hypothesis (RH), Behavioral Hy- 
pothesis (BH), and Behavior Inten- 
tion (BI), RH, BH, and BI are all 
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assessed by means of retrospective 
verbal report. 

Dulany (1962) presents results in- 
dicating that the higher the level of 
RH, BH, and BI reported by S, the 
greater the conditioning of "I-we" 
sentences, i.e., in the Taffel (1955) 
VOC procedure. In that same paper 
Dulany (1962) reported supporting 
evidence for his theoretical assump- 
tion that RH and BH should affect 
performance only through BI, viz., 
that the correlation between perform- 
ance and BI, BH, and RH fell off in 
the expected direction—.71, .57, .36, 
respectively, each significantly less 
than the preceding. 

The present experiment is designed 
to test certain assumptions in and 
predictions from the Dulany (1962) 
theoretical system. It is the feeling 
of the writers that the question of 
whether S's increase in rate of emis- 
sion of the reinforced class in the VOC 
situation is mediated by cognitive 
Processes (‘awareness’) or not is still 
not settled, in spite of the Dulany and 
O'Connell tour de force ("Does Par- 
tial Reinforcement Dissociate Verbal 
Rules and the Behavior They Might 
Be Presumed To Control")? in which 
they showed that the Verplanck 
(1962) results which cannot be ac- 
counted for within the Dulany (1962) 
theoretical network are probably arti- 
factual. It seems likely to the writers 
that the typical finding of a positive 
relationship between "awareness" and 

OC may be primarily due to the fact 
that such a simple and obvious con- 
tingency has been involved. The 
Taffel (1955) procedure, which Dulany 
(1962) used, requires that 5 construct 
Sentences that begin with one of 
Several pronouns presented on a card, 
and E reinforces whenever 5 begins 
the sentence with “I” or "we." The 


*D. E, Dulany and D. C. O'Connell, 
Personal communication, May 1963. 
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S need not be too perceptive to detect 
this contingency between E's rein- 
forcer and S's response in that situa- 
tion, provided he is given the op- 
portunity to formulate hypotheses 
regarding the contingency during the 
experiment. 

The rationale for this experiment 
involves using that same Taffel (1955) 
procedure, but interfering with .S's 
opportunity to formulate such hy- 
potheses between trials during the 
experimental session. Between trials 
some of the Ss are required to perform 
a color-naming task, which would 
presumably interfere with such hy- 
pothesis-formulating behavior. If the 
Spielberger-Dulany position is correct, 
it would be expected that such inter- 
ference between trials would reduce 
the possibility of conditioning in this 
situation, whereas if the Verplanck- 
Krasner direct-strengthening inter- 
pretation is correct, there should be 
no effect on the results with respect to 
conditioning, but for Ss performing 
the color-naming task between trials 
the positive relationship usually found 
between "awareness" and condition- 
ing scores should not be found. Fur- 
ther, RH, BH, and BI should not be 
related to conditioning scores in the 
way that Dulany found them to be. 


METHOD 


Subjects—The Ss were 100 students from 
introductory psychology classes at the Uni- 
versity of Hawaii. They were randomly 
assigned to four groups of 25 each: two ex- 
perimental groups, which both received rein- 
forcement, one group with the color-naming 
task between trials, the other without; and 
two control groups, neither of which was 
reinforced, one with the color-naming task 
between trials, the other without. 

Apparatus—The stimulus materials for 
the conditioning task consisted of 3 X 5 in. 
white index cards on each of which a different 
neutral past tense verb was typed. These 
verbs were taken from the list used by Binder 
and Salop (1961). The pronouns I, WE, you, 
HE, SHE, and THEY were typed above the verb 
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with the order of appearance of the pronouns 
randomized over all cards. The S was 
presented with easily recognizable colors 
shown in the aperture of a memory drum. 
The colors were arranged in random order by 
selection from a large assortment of colors. 

A screen with its leaves folded in at ap- 
proximately 120° toward Swasused to separate 
S from E. The central portion of the screen 
was 26 X 30 in. and the leaves were 12 X 30 
in. A place for the presentation of the card 
was made 12 in. above the base of the screen 
and in the center of the central panel. Be- 
neath the hole through which the card was 
presented, a slot was cut in the screen to fit 
the memory drum so that the memory drum 
would face S. The colors on the memory 
drum thus appeared directly below the card 
holder. 

The E used a scoring sheet on which the 
numbers 1-120 were followed by the pronouns 
used in the experiment, and he encircled the 
single pronoun used by S on each trial on that 
data sheet. 

Procedure.—Although the results of the 
testing were not used in the present study, 
all Ss were first administered the MMPI in 
one room before being brought to the experi- 
mental room. In the experimental room a 
short informal preconditioning interview was 
held with E in order to establish a friendly 
relationship between E and S. After this 
informal interview, the usual instructions for 
the Taffel (1955) procedure were read to S, 
The Ss in the color-naming group were in- 
structed to name the colors appearing on the 
memory drum between trials. 

The Ss then proceeded to construct sen- 
tences from the 120 cards on which the 
pronouns and the verb had been typed. For 
the reinforced groups, RC (reinforced control) 
and RN (reinforced naming), during the first 
20 trials every third sentence constructed by 
S was followed by E murmuring “mm-hmm” 
regardless of which pronoun was used. This 
was done so that the first reinforcement after 
the first block of 20 trials, the Operant level 
trials, would not be such a surprise to § 
(Matarazzo, Saslow, & Pareis, 1960). After 
the first 20 trials, each time S began a sentence 
with either "I" or "we" E said “good” at the 
end of the sentence for both reinforced groups, 
RC and RN. 

For the nonreinforced groups UC (un- 
reinforced control), and UN (unreinforced 
naming), E remained quiet throughout the 
[20 trials. 

The color-naming task performed by the 
olor-naming groups, RN and UN, consisted 
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of naming colors during the 10-sec. intertrial 
period. The colors appeared in the memory 
drum every 1.43 sec. The non-color-naming 
groups, RC and UC, did not perform any task 
during the 10-sec. intertrial interval. 

Following the presentation of the 120 cards 
all Ss received the postconditioning interview, 
which was a series of questions read to S by 
E one at a time, with the answers to the 
questions being written by S on a blank sheet 
of paper. The interview used was patterned 
after that used by Dulany (1962). 

Two graduate students in psychology were 
used as judges to rate the interviews obtained 
from Ss. The judges rated the interviews of 
the Ss for BI, RH, and BH. The criteria used 
were those given by Dulany (1962). 


RESULTS 


The 120 trials were divided into six 
blocks of 20 trials each. The first 
block of trials was considered the 
Operant level for each S. Number of 
"I-we" sentences was taken as the 
dependent variable. An analysis of 
variance of operant levels indicated 
that the color-naming variable was 
significant, F (1, 96) = 7.62, p < .01, 
with a higher operant level for the 
non-color-naming groups, but that 
the reinforcement variable, F (1, 96) 
= 3.89, p > .05 and the Reinforce- 
ment X Color-Naming interaction, 
F (1,96) — 0.00, b> .05 were both 
nonsignificant, 

An analysis of covariance was made 
using operant level as the control 
variable and involving the scores of 
all four groups on three variables: 

; reinforcement; B, intertrial activ- 
ity; and C, blocks of trials. The 
results of the analysis show that there 
was a significant overall reinforcement 
effect (A), F (1,95) = 12.57, p < .001, 
1e., the reinforced and nonreinforced 
Ss differed Significantly in the total 
number of ” J-we” sentences produced. 

he intertrial activity variable (B) 
Was not significant, F (1,95) = 0.06, 
b > .05, nor was the interaction of 
intertrial activity and reinforcement 
(A X B), F (1,95) = 1.30, p > .05. 
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BLOCKS OF TRIALS 


Fic. 1. 


Means for the four groups on the five blocks of trials 


adjusted for operant rates. 


Both blocks of trials (C), F (4, 380) 
— 4.37, p « .01, and the interaction 
of blocks of trials with reinforcement 
(A X C), F (4, 380) = 8.59, p < .001, 
Were significant. Neither the inter- 
action of intertrial activity and blocks 
of trials (B X C), F (4, 380) = 1.08, 
P > .05, nor the three-way interaction 
of reinforcement, intertrial activity, 
and blocks of trials (A X B X C), 
F (4, 380) = 0.95, p > .05, was sig- 
Nificant. Figure 1 presents the means 
for the four groups on the five blocks 
Of trials adjusted for operant rates. 

The degree of interjudge agreement 
for 100 interviews was 97% for BH, 
97% for RH, and 85% for BI. The 
Sum of the ratings for the two judges 
9n each measure was used to correlate 
With conditioning score for S. 

Table 1 presents correlations be- 
tween a conditioning score obtained 
from the analysis of covariance, con- 
Sisting of the individual's adjusted 
Score on the last block of trials (ad- 
Justed for operant level), with levels 
of BH, RH, and BI for the two rein- 
forced groups, RN and RC. The 
Significant correlation between con- 
ditioning and RH in Group RC 


indicates that an increase in "aware- 
ness" was significantly related to an 
increase in conditioning in Group RC. 
(Smaller values for RH indicate 
greater degrees of "awareness.") The 
nonsignificant correlation between 
conditioning and level of RH in Group 
RN indicates that the degree of 
awareness was not significantly related 
to conditioning in the color-naming 
group. The difference between the 
correlations for Groups RN and RC 
is significant, Z = 2.83, p < .01. 
None of the other correlations were 
significant. 

Chi square was used to compare the 
two reinforced groups, RC and RN, 
with respect to levels of RH. The 
value of chi square obtained (using a 


TABLE 1 


CORRELATION COEFFICIENTS OF LEVELS OF 
BH, RH, AND BI wit CONDITIONING 
Scores ror Groups RN AND RC 


Group BH RH BI 
RN —.13 .06 —.06 
RC —.25 —.66** —.30 

**5 «.01. 
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TABLE 2 


RH VALUES (SUMMED SCORES or Two 
JUDGES) FoR Groups RN anp RC 


RH Level 
Group 
2 3 4 5 6 7 8 
RN 3 1 8 11 2 
RC 2 7 |2 |14 


collapsed contingency table), X? (2) 
= .16,p > .05, indicated that Groups 
RC and RN did not differ with respect 
to the distribution of levels of RH. 
Table 2 presents the distribution of 
frequencies in the RH categories for 
the two reinforced groups. 


Discussion 
As indicated above, the Dulany- 
Spielberger position holds that the 


"awareness" on the part of S of the con- 
tingency between his verbal response and 
E's reinforcer is a necessary condition 
for VOC. Dulany’s (1962) postulates 
would predict significant correlations 
between BI, BH, RH, and conditioning. 
They would also predict that the magni- 
tude of the correlations of these variables 
with conditioning scores would be of the 
order BI > BH > RH. From the Du- 
lany (1962) theory, we would also 
predict that if the color-naming proce- 
dure interferes with the hypothesis- 
formulating behavior of S in any way, 
either through BI, BH, or RH, it should 
also interfere to a comparable degree 
with the conditioning score of S. 

In checking these predictions from 
the Dulany-Spielberger interpretation 
against the results of the present experi- 
ment, we find that certain of them are 
not confirmed. In the first place, the 
color-naming task required between trials 
for Group RN did not adversely affect the 
conditioning of that group as would be 
expected from the theory. The analysis 
of covariance and Fig. 1 indicate that 
although the overall conditioniag results 
show that there was learning in both 
reinforced groups (RN and RC), the 
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nonsignificant A X B X C interaction 
indicates that the reinforced groups did 
not differ with respect to the amount of 
learning. This can be seen clearly from 
the graph in Fig. 1. However, the chi- 
square analysis showed that the dis- 
tribution of RH ratings did not differ for 
Groups RN and RC. Thus, the two 
reinforced groups had the same distribu- 
tion of RH ratings, but the correlation 
analysis (Table 2) indicates that the 
degree of "awareness" of the individual 
Ss in Group RC as indicated by RH 
ratings was significantly correlated with 
the degree of conditioning, while there 
was no such relationship found for Ss in 
Group RN. The degree of relationship 
between RH ratings and conditioning 
did differ significantly for Groups RN 
and RC. These findings suggest that 
the nature of the experimental task, 
together with the interview afterward, 
results in a certain distribution of degree 
of "awareness" among Ss in a reinforced 
group, but that the degree and direction 
of the relationship between the level of 
"awareness" of individual Ss and their 
degree of conditioning is dependent upon 
the degree to which Ss are able to 
formulate hypotheses during the acquisi- 
tion phase of the experiment. These 
findings would not be expected on the 
basis of the Dulany-Spielberger analysis 
of the VOC situation. 

The further prediction from the Du- 
lany theory that the magnitude of the 
correlation of the awareness variables 
with conditioning scores would be of the 
order BI > BH > RH was also not con- 
firmed with our data. For Group RN 
the order was BH > BI > RH, and for 
Group RC the order was RH > BI > BH. 

Although our results are contrary to 
what would be expected on the basis of 
the Dulany-Spielberger interpretation 
of VOC, they do seem to be consistent 
with what would be expected from the 
Verplanck-Krasner position. The Ver- 
planck-Krasner position holds that there 
is no necessary relationship between 
degree of conditioning and the level of 
"awareness" of Ss of a contingency be- 
tween S's response and E's reinforcer. 
This interpretation holds that the action 
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of the reinforcer is direct, i.e. not 
mediated by cognitive processes, and 
that there may or may not be a relation- 
ship between the performance of S and 
his verbalization about his performance, 
i.e., his "awareness" of it, depending on 
the particular circumstances of the ex- 
periment. This is essentially the posi- 
tion of Skinner (1953, 1957). It would 
seem from our results that Verplanck 
(1962) and Krasner (1962) are correct 
in assuming that there is no necessary 
relationship between VOC scores and 
degree of awareness. 

We would conclude, then, that al- 
though in the typical VOC experiment a 
positive correlation is found between 
level of "awareness" and amount of 
conditioning, this is due to the oppor- 
tunity Ss have during the acquisition 
phase of the experiment to formulate and 
test hypotheses regarding the relation- 
ship between E's reinforcer and S's be- 
havior, but that there is no necessary 
relationship between “awareness” and 
such learning. When conditions are such 
that S is not able to formulate hy- 
potheses during acquisition, the same 
degree of learning may be exhibited as 
when S does have the opportunity to 
formulate hypotheses, indicating a direct 
strengthening effect of the reinforcer, 
which is not correlated with the degree 
of S's awareness. However, when such 
Ss are later interviewed, as a result of the 
Nature of the experimental task and the 
interview, a certain distribution of 

awareness” levels will be exhibited by 
Ss. But this "awareness" on the part of 
individual Ss, which may indeed be 
Produced by the interview itself, need 
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not be related to the degree of condition- 
ing exhibited by the particular Ss during 
the experiment. 
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THE PARTIAL-REINFORCEMENT EFFECT SUSTAINED 


THROUGH BLOCKS OF CONTINUOUS 
REINFORCEMENT IN CLASSICAL 
EYELID CONDITIONING ! 


SALLY L. PERRY? Asp JOHN W. MOORE 


University of Massachusetts 


'The partial-reinforcement effect (PRE) was demonstrated in human 
eyelid conditioning by 2 groups of 24 Ss each: one group received 80 
acquisition trials under 50% random reinforcement. The second group 
received 100% reinforcement. Percentage CRs over 20 extinction 
trials was highest for the first group. A third group was shifted from 
50% to 100% reinforcement after Trial 40 and then extinguished after 
Trial 80. Covariance adjustments of the extinction performance of 
this group for acquisition performance and for intertrial blink rate 
indicated that the PRE was sustained through the block of continuous 
reinforcement. A modification of the Humphreys-expectancy version 


of the discrimination hypothesis could predict this result. 


Discussion of the results of some 
recent experiments (e.g., Spence, Rut- 
ledge, & Talbott, 1963) involving 
extinction of the conditioned eyelid 
response of human Ss has favored a 
Humphreys-expectancy version of the 
discrimination hypothesis (Lawrence 
& Festinger, 1962, p. 14) to account 
for the partial-reinforcement effect 
(PRE). Any application of the dis- 
crimination hypothesis must be ques- 
tioned in light of experiments in 
instrumental conditioning, with both 
human and rat Ss, which have demon- 
strated that the degree of shift in 
reinforcement schedule from acquisi- 
tion to extinction has little effect upon 
the PRE (e.g., Capaldi & Capaldi, 
1963; Jenkins, 1962), and, in the ex- 
treme ease, that the PRE is sustained 
through blocks of continuous rein- 
forcement (Theios, 1962). The pres- 


1 This investigation was supported in part 
by Public Health Service Grant MH-7630-01 
from the National Institute of Mental Health 
and was carried out by the first author in 
partial fulfillment of the requirements for the 
MS degree. We wish to thank L. E. Price 
for providing some of the apparatus. 

2 Now at Northeastern University. 


ent experiment was designed to deter- 
mine whether the prediction of the 
discrimination hypothesis that the 
PRE should not be sustained through 
blocks of continuous reinforcement 


is at least true for human eyelid 
conditioning.* 


METHOD 


The details of the apparatus have been 
described elsewhere (Mattson & Moore, 
1964). Eighty-one student volunteers served 
as Ss in three groups. Group 50-50 received 
50% partial reinforcement for 80 acquisition 
trials. Group 50-100 received 50% partial 
reinforcement during the first 40 acquisition 
trials and was then shifted to 100% reinforce- 
ment for another 40 trials. Group 100-100 
received 100% reinforcement for 80 acquisi- 
tion trials. All groups received 20 extinction 
trials with no UCS presentations. Half of the 
Ss in each of these groups were males and half 
were females. Nine Ss (3 from each group) 
were discarded for failure to give at least three 
CRs in the last 10 acquisition trials. Of the 
remaining 72 Ss, 12 were assigned to each of 
the six cells ina 3 x 2 randomized design in 
which schedule of reinforcement (50-50, 50- 
100, and 100-100) was orthogonal to S's sex. 


3 The arguments involved in the derivation 
of this prediction have been outlined by 
Theios (1962). 
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only to the first trial.) 


"n he Ss were given "neutral" instructions 
trial an intercom immediately before the first. 
SPL The CS was an 800-cps tone of 70 db. 
E eee of 550 msec. duration. The UCS 
On En 0 Ib/sq in air puff of 50 msec. duration. 
5 bur ng trials the CS-UCS interval was 
cada The 50% reinforcement schedule was 
fir wee so that (a) no more than 4 rein- 
xc or nonreinforced trials occurred con- 
cort ively and. (b) each block of 10 trials 
ES 5 reinforced and 5 nonreinforced 
ans The intertrial intervals were 15, 22.5, 
tri 0 sec. randomly distributed over the 100 

ials of the experiment. 
E A CR was defined as any deflection of the 
qeording pen of at least 1 mm. from the base 
ine and occurring in the range of 150 to 500 
msec, after CS onset. Intertrial responses 
Were recorded in order to increase the preci- 
Sion of statistical analysis of conditioning 
Scores (Mattson Moore, 1964). The 
criterion for an intertrial response was any 
deflection and return of the recording pen of 
BE. least 1 mm, The blink rate for each inter- 
trial interval was obtained by summing the 
number of blinks in the interval and dividing 
E the number of successive 500-msec. seg- 
nents within that interval. If no more than 
9ne response is counted in a 500-msec. seg 


hedule of reinforcement as the parameter. 


function of 10-trial blocks in acquisition and 5-trial 


(The first point refers 


ment, this procedure yields a percentage score 
which can be placed on the same scale of 
measurement as percentage CRs. 


RESULTS AND DISCUSSION 


Figure 1 presents the percentage of 
CRs for the three groups as a function 
of trial block. Groups 50-50, 50-100, 
and 100-100 gave 64, 68, an 
CRs, respectively, on the last block 
of acquisition trials, but the order of 
responding in extinction for the three 
groups, from highest to lowest, was a 
complete inversion of this order with 
Group 50-100 approximately midway 
between Groups 50-50 and 100-100. 


4 Results of experiments by Gormezano 
and Moore (1962) and Moore and Gormezano 
(1963), which are based on procedures com- 
parable to those in the present study, suggest 
that most “yoluntary” responders would be 
among those Ss who gave more than 80% 
CRs in acquisition. The number of Ss in 
Groups 50-50, 50-100, and 100-100 who gave 
more than 80% CRs in acquisition was one 
two, and seven, respectively. i 
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For all three groups, intertrial blink 
rate (not shown) remained essentially 
constant at a value of 20% in acquisi- 
tion and 15% in extinction. 

For statistical analyses an arcsine 
transformation was applied to each 
of the following percentage scores for 
each S: (a) CRs based on 80 acquisi- 
tion trials, (b) CRs based on the last 
40 acquisition trials, (c) CRs in ex- 
tinction, and (d) intertrial blink rate 
in extinction. For ease of presenta- 
tion, these five measures are hence- 
forth referred to as (a) acquisition 
score, (b) asymptotic acquisition score 
(c) extinction score, and (d) extinction 
blink rate, respectively. 

An analysis of variance of extinction 
scores yielded a significant main effect 
due to the schedule of reinforcement 
in acquisition, F (2,66) = 3.91, p < .05. 
This main effect was even more signifi- 
cant, F (2,65) = 11.37, p < .001 when 
extinction scores were adjusted for 
acquisition scores by an analysis 
of covariance. With an additional 
covariate, extinction blink rate, sched- 
ule of reinforcement was still highly 
significant, F (2,63) = 12.97, p < .001. 
Since the asymptotic acquisition scores 
might be considered the more appro- 
priate covariate for extinction, this 
measure and extinction blink rate 
were used in an analysis of covariance 
of extinction scores which again in- 
dicated a significant effect due to 
schedule of reinforcement, F (2, 63) 
= 11.26, p < .001. The correlation 
between extinction scores and extinc- 
tion blink rate, averaged over the six 
cells of the design, was only .28. 

Using acquisition scores and extinc- 
tion blink rate as covariates, the 
adjusted mean extinction scores for 
the three main groups were 42.19 
(50-50), 37.35 (50-100), and 20.04 
(100-100). The difference between 

the adjusted means of Groups 50-50 
and 50-100 was not significant, but 
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Groups 50-100 and 100-100 differed 
significantly from each other, F (1, 63) 
—15.84, p < .001. Using the asymp- 
totic acquisition scores and blink rate 
in extinction as the covariates, the 
adjusted mean extinction scores were 
37.95 (50-50), 34.95 (50-100), and 
26.99 (100-100). The mean of Group 
50-100 was again closer to that of 
Group 50-50 than to that of Group 
100-100, but the difference between 
50-100 and 100-100 was not significant. 

These analyses suggest that the 
PRE was sustained through the block 
of continuous reinforcement, and, in 
light of the analyses of covariance, 
any attenuation of the PRE by con- 
tinuous reinforcement was slight. lf 
one assumes that resistance to ex- 
tinction depends not only on S’s 
awareness of the change in reinforce- 
ment schedule from acquisition to ex- 
tinction but also upon his interpreta- 
tion of the change, then an expanded 
form of the Humphreys-expectancy 
type discrimination hypothesis might 
account for the results. The Ss in 
Group 50-100 may have interpreted 
the onset of extinction as a reinstate- 
ment of partial reinforcement rather 
than as the start of extinction, and the 
extent to which such an interpretation 
was made should have increased the 
level of responding of these Ss relative 
to those in Group 100-100. The Ss 
in the latter group, having had no 
previous experience with partial rein- 
forcement, could not be expected to 
have interpreted the shift as the in- 


troduction of a partial-reinforcement 
Schedule. 
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BASES FOR PREFERENCES AMONG 
THREE-OUTCOME BETS! 


SARAH LICHTENSTEIN ? 
University of Michigan 


This experiment compared the moment functions of a bet as predictors 
of choices among bets with the subjectively expected utility (SEU) 
maximization model. Bet parameters of Expected Value (EV), Vari- 
ance (V), Skewness (Sk), and probabilities were independently varied 
in 182 3-outcome bets. Each of 12 Ss bid on each bet and played some 
bets for real money. Ss preferred high EV and low V, but had no 
preferences within Sk or probability levels. Sets of bets existed which 
were equal in all parameters; within such sets, Ss preferred bets with 


the largest least likely amount. Th 
rejected; the SEU model was not. A 
was suggested. 


Research on decision making in 
gambling settings has focused in re- 
cent years on a model called the 
subjectively expected utility (SEU) 
maximization model. This model 
States that choices among bets can be 
predicted from the maximization of 
the SEU function, SEU = Vatt + Poto 
Tec aun, where there are n pos- 
sible outcomes of a bet, the first out- 
come has utility vı and subjective 
probability yı, and so on. Research 
on the model has concentrated on 
scaling the utility and subjective 
probability curves for individuals or 
groups of Ss. For reviews of this 
literature see Edwards (1954a, 1961). 

Criticism of the SEU model has 
come from Coombs and Pruitt (1960), 
who feel that the SEU model is un- 
desirable because of its relative in- 
vulnerability to experimental refuta- 


1 This paper is based on a PhD dissertation 
submitted to the Department of Psychology 
of the University of Michigan. The author is 
greatly indebted to Ward Edwards for his 
encouragement and guidance throughout the 
investigation. This research was supported 
in part by the Air Force Office of Scientific 
Research, Grant No. AFOSR 192-63. ] 

?Now at Southern Illinois University, 
Carbondale, Illinois. 
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e moment-function approach was 
lexicographic ordering of variables 


tion, in that utility and subjective 
probability curves may be found to fit 
almost any set of data. Accordingly, 
they argue against SEU model not by 
collecting evidence which refutes it, 
but by proposing an alternative set of 
variables which might be used as the 
basis of a new model, and which they 
believe is more vulnerable to experi- 
mental evidence. 

Coombs and Pruitt point out that 
the value or Expected Value (EV) of 
a bet is the first of a series of moment 
functions which uniquely describe the 
bet, when the bet is viewed asa proba- 
bility-density distribution. Other mo- 
ment functions are Variance (V), 
Skewness (Sk), and Kurtosis (K). 
EV has long been recognized as an 
important determinant of choices 
among bets, Many authors (e.g. 
Allais, 1953; Edwards, 1954b; Roy- 
den, Suppes, & Walsh, 1959) have 
Proposed that V is also important. 
For two-outcome bets, Sk is a simple 
function of the probabilities, so that 
Probability preferences might be re- 
labeled Sk preferences. Coombs and 
Pruitt proposed the moment functions 
as a possible set of Parameters useful 
in predicting choices among bets. 


PREFERENCES AMONG 


: The SEU model states that a func- 
tion of two subjective distributions, 
utility and subjective probability, 
determines choices among bets. In 
contrast, Coombs and Pruitt suggest 
that the entire shape of the objective 
distribution determines choices. The 
moment functions are a set of param- 
eters summarizing that shape. Either 
of these general models can encompass 
so much data that experimental ref- 
utation is difficult. But both these 
approaches are vulnerable to experi- 
mental refutation when details of the 
model are specified, as for example 
Davidson, Suppes, and Siegel (1957) 
have done for the SEU model, or when 
a small set of moment functions, EV, 
V, and Sk, limits the objective- 
distribution approach. 

A decision between the moment 
functions and the SEU model, how- 
ever, is hampered by a lack of sys- 
tematic, empirical information about 
the effects of a number of bet variables 
upon Ss’ choices. This experiment 
Was designed to supply this lack by 
exploring the relevance and relative 
importance of the following bet pa- 
rameters: the first four moment 
functions, EV, V, Sk, and K, as de- 


TABLE 1 
FORMULAE ron MOMENT FUNCTIONS 


Moment Definitional 
Notation Formula 
Expected Value m z pi$i 
Variance m. | Edisa 
i 
Z piSi— m)? 
Skew Bae i 
ness [uae] [ese] 
Z p: ($: — mu) 
Kurtosis Hie í 
[ezet [ezet 


„Note.—p; is the probability of the ith outcome; 


$i is the amount of money to be won or lost 
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fined in Table 1, and 
probabilities of a bet. 

In order to evaluate properly the 
effectiveness of these bet parameters, 
it is important that they be mutually 
independent in the experimental de- 
sign. This has not been true in 
previous research. Edwards (1953), 
in exhibiting probability preferences, 
confounded probability, V and Sk. 
Coombs and Pruitt (1960) confounded 
probability with Sk. Such confound- 
ing is inevitable when bets having only 
two outcomes are used, since the 
equation for Sk then reduces to a 
simple function of the probabilities 


(s: = m) 
Npips 


This study used three-outcome bets, 
permitting independent variation of 
probabilities, EV, V, and Sk, with K 
confounded. The amounts of money, 
$;, needed to define a bet, given the 
values p, EV, V, and Sk, may be 
found by solving the equations in 
Table 1 by approximation techniques.* 
In some cases there are multiple 
solutions of $;, so that several different 
bets exist which are equal in their 
probabilities, EV, V, and Sk, but 
which differ in wins and losses. For 
example, the bet, “1/8 chance to win 
$1.82; 2/8 chance to lose $2.49; 5/8 
chance to win $.63,” and the bet, ‘1/8 
chance to lose $3.00; 2/8 chance to 
lose $1.23; 5/8 chance to win $1.08” 
both have EV =0, V = 2.25, and 
Sk = —.9. When probabilities in 
units of eighths are used, no bets exist 
when Sk is outside the range +2.26. 
Within this range, from zero to four 
bets exist for each choice of proba- 
bilities, EV, V, and Sk. 


the set of 


3 The author is indebted to L. J. Savage for 
suggesting the graphic met hod of solution 
used to obtain the bets. 
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The existence of several bets all 
having the same probabilities, EV, V, 
and Sk is a powerful feature of the 
experimental design. It permits a 
direct assessment of the value of the 
moment functions as a useful set of 
predictive variables. If the variation 
of S’s responses within such a set is as 
large as the variation between such 
sets, the moment functions will be 
judged inadequate, unless K, the next 
such function, is related to the varia- 
tions in response. 

To measure Ss’ preferences, a 
single stimulus, real gambling bidding 
method was employed, both because 
the more usual paired-comparison 
techniques are impractical when large 
numbers of bets are used, and as a 
means of increasing Ss’ motivation. 
Previous research with this bidding 
method (Slovic, Lichtenstein, & Ed. 
wards, in press) indicated that it pro- 
vides powerful ratio scale data which, 
when reduced to order relationships, 
are similar to data obtained by paired 
comparisons. Its greater economy 
over paired-comparison methods and 
its attractiveness to Ss further argue 
for its use. 


METHOD 
Stimulus Material 


The stimuli used were 182 three-outcome 
bets and 7 two-outcome bets, Each bet was 
presented five times. The three-outcome bets 
were computed from combinations of the 
following levels of the bet Parameters. The 
number of different bets at each level is shown 
in parentheses. 

Probability.—All five possible triples based 
on the unit of an eighth were used: 1/8, 1/8, 
6/8 (16 bets); 1/8, 2/8, 5/8 (96); 1/8, 3/8, 
4/8 (28); 2/8, 2/8, 4/8 (18); and 2/8, 3/8, 

24). 
Bw levels: —83¢ (39 bets), zero 
and 4-83£ (39). 
Mr on o levels: a (107 bets), 1.00 (25), 
5), and 4.00 (25). 
gin o levels: —1.8 (16 bets), —.9 (42), 
zero (66), +.9 (42), and +1.8 (16). 
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K.—The bets also varied in K, ranging 
from 1.3 to 5.2, K remains constant with 
changes in EV and V, and with changes in the 
sign of Sk. 

Number of bets.—When probability and Sk 
are varied, while EV and V are held constant, 
41 different bets may be calculated. A linear 
transformation on any bet produces a new bet 
with the same probabilities and Sk, but new 
EV and V. Thus when EV and V are varied, 
à total design of 492 bets (41 X3EV Xx 4 V) 
is produced. The 182 three-outcome bets 
used in the experiment comprise a subset of 
the possible 492 bets, This subset was chosen 
so that only second- and third-order inter- 
action effects would be lost to the analysis. 
The bets used are listed in Lichtenstein (1962). 

The seven two-outcome bets had EV = 0 
and V = 25, with Sk values ranging from 
—2.26 to 4-2.26. 

The design of this study is incomplete, due 
both to the variable number of bets which can 
be computed for given values of the param- 
eters, and to the reduction in the number of 
bets used from 492 to 182. 


Response Method 


The bets were Presented one at a time to 
each S. The bets were shown on 4 X 6 in. 
cards as pie diagrams, with the amount of 
money corresponding to each Outcome written 
inside the sectors of the pie, e.g., “win $1.00," 
"lose $.50," or “win nothing." 

For each bet, S was asked to state the 
largest amount of money he would be willing 
to pay E in order to play the bet, 


The S was told that this amount of money, 
be compared with 


give careful responses; there was no such 
comparison group. The Æ actually accepted 
a random third of S's bids, so that S did not 
receive any consistent feedback useful to him 
in developing à strategy of play, 

When a bid was declared acceptable by E, 
the bid amount Was exchanged and the bet 
was played. When the bid was refused, the 
bet was not played. 

The bets were Played by spinning 4 
balanced aluminum Pointer placed over à 
Plastic-disc 8 in, in diameter, Five discs cor- 
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responding to the five probability combina- 
tions for three-outcome bets, and four discs 
with two-outcome probabilities, were prepared 
by painting the backs in colors red, blue, and 
green, in pie-slice sectors. 


Procedure 


Subjects.—The Ss were 12 male volunteers, 
undergraduates from the University of 
Michigan. 

Sessions.—The Ss were run singly in seven 

2-hr. sessions spread over a 2-wk. period. 
Five of the sessions were devoted to the ex- 
perimental bets; in each session Ss bid on 189 
bets, a single complete replication. The bets 
were presented in a different random order for 
each session; all Ss saw the bets in the same 
sequence. 
_ Two sessions, the fourth and last, were ad- 
Justment sessions to control the total amount 
of each S's winnings. In these sessions S 
made paired-comparison choices between 
Pairs of bets. The S was told he would play 
a random half of his choices for money. 
Actually Ss who had already won more than 
the budgeted amount played mostly negative 
EV bets, while Ss who had lost money or had 
Won less than the budgeted amount played 
mostly positive EV bets. 

Financial exchanges.—All Ss were told at 
the beginning of the experiment that they 
Would be gambling for real money, and that 
While the game was biased in their favor, 
there was a possibility that they could lose 
Some of their own money. Financial ex- 
Changes during a session were made with 
Poker chips. The Ss were advanced $30.00 
In chips at the beginning of each session and 
TeQuired to return the same amount at the end 
of the session, 


RESULTS 


Data analysis —Analysis of the data 
Was based on the amount of money 
each S bid for each replication of each 

et. A positive number means B 
Offered to pay E for the bet; a negative 
number means S required E to pay 5 

efore he would play the bet. Because 
9f the incompleteness of design, when 
Means are computed across all the bets 
Used in the study certain levels of the 
Parameters are overrepresented. In 
Order to avoid drawing false conclu- 
Sions from data in which several levels 
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of the variables are unevenly rep- 
resented, five small subsets of the 
data, each factorially complete, and 
each representing the variation of two 
parameters while two are held con- 
stant, were subjected to analyses of 
variance. Thus any conclusion drawn 
from the total data was checked in two 
or three narrower but purer settings. 

Subject consistency.—The Ss bid 
carefully and with moderate consist- 
ency. The average standard deviation 
of the five bids a single S made to a 
single bet in five replications was 14¢, 
for all Ss and all bets. The most 
consistent S had a per-bet average 
standard deviation of 56; for the most 
variable S this figure was 234. 

All Ss made clear by their comments 
during the experiment that they ac- 
cepted the competitive bidding fiction 
and believed in the game; they were 
trying very hard to play the game 
wisely and to win money. When Ss 
were winning, they were happy, cheer- 
ful, and “chatty.” When Ss were 
losing they became silent, moody, and 
often rude, picking arguments over 
trivia. 

Expected Value.—All Ss were sensi- 
tive to the EV of the bets. The first 
column in Table 2 shows the mean 
amount bid for each EV level. Over 
the entire experiment, the average bid 
was 12¢ below EV. Only one S often 
bid too high; 68% of his bids averaged 
over replications were above EV. 
For the other 11 Ss this figure was 
16%. 

Variance.—The Ss were also sensi- 
tive to changes in V, as shown in 
Table 2. The Ss bid more for low V 
bets than for high V bets. The data 
were then divided into 36 subsets 
comprising each of the 12 Ss and each 
of the 3 EV levels. The bets within 
each subset were constant in EV but 
differed in V, Sk, and probabilities, 
Each subset was examined for V 
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TABLE 2 


MEAN Amount BID By ALL Ss ron Eaca LEVEL or EACH PARAMETER 


Expected Value Variance Skewness Probabilities 
Level Mean Bid Level Mean Bid Level Mean Bid Level Mean Bid 
— —91 25 —07¢ = LS —06¢ 1/8, 1/8, 6/8 —09¢ 
mY 1.00 ~— 12% L9 —09¢ 1/8, 2/8, 5/8 —13¢ 
83¢ 67¢ 2.25 —17¢ 0 —13¢ | 1/8, 3/8, 4/8 —09¢ 
4.00 =Z5¢ 9 =12 2/8, 2/8, 4/8 —08¢ 
L8 | ~12¢ | 278 378 3/8| —10€ 


Note.—A negative bid means S required E to pay S before S would play the bet. 


preferences. Of these subsets, 21 sets 
showed a simple ordering from low V 
to high (10 sets with zero EV, 6 with 
positive EV, and 5 with negative EV). 
Of the other 15 sets, 11 had no clear 
preference order but differed from the 
order shown in Table 2 by only a few 
cents, while only four sets had clearly 
different orders. 

Skewness.—Skewness had no notice- 
able effect on Ss' bidding. Inspection 
of the data revealed that no single S 
had a clear preference ordering among 
the five Sk levels, nor was there any 
small effect cumulative across Ss. 
Table 2 shows the mean amount bid 
at different Sk levels. 

Probability.—There was no evidence 
for probability preferences for any 
single S, nor for all Ss cumulatively, 
as shown in Table 2. 

Analyses of variance.—The five 
subsets of data subjected to analysis 
of variance (EV X V, EV x Sk, EV 
X Probability, V X Sk, and V x Prob- 
ability) confirmed the above results. 
EV and V main effects were extremely 
large, while Sk and probability showed 

no effects. The Subjects main effects 
were of moderate size. All interac- 
tions were insignificant. 

Kurtosis.—Correlations between K 
and amount bid averaged over replica- 
tions were computed for each Ss and 
for each EV level separately. These 
correlations ranged from —.32 to 


+.30, with a mean of +.02. More 
detailed analyses checking for possible 
curvilinear relationships, and separate 
analyses for each V level, indicated no 
relationship between K and amount 
bid. 

Bets with equal independent vari- 
ables.—The existence of multiple bets 
with constant probability, EV, V, and 
Sk provides an opportunity to test 
for bet preferences unrelated to the 
moment functions. These sets of bets 
were examined to see if differences in 
amount bid among the bets were large 
or interpretable. The design includes 
12 such sets of four bets each, 6 sets of 
three bets each, and 46 sets of two 
bets each. 

Consistent ordering of these bets 
within sets was shown; Ss consistently 
preferred bets in which the amount of 
money associated with the least likely 
outcome was large and positive. For 
example, Ss bid 204 more, on the 
average, for the bet, “1/8 to win 
$1.83; 2/8 to lose $2.49; 5/8 to win 
$.63,” than for the bet, “1/8 to lose 
$3.00; 2/8 to lose $1.23; 5/8 to win 
$1.08." Both these bets have EV = 0, 
V = 2.25, and Sk — .9. 

Tab 


able 3 summarizes these differ- 
ences among bets of equa] independent 
variables, under the hypothesis that 
Ss will prefer bets in Which the least 
likely outcome h 


as the largest win. 
For sets of three 


and four bets, the 
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differences shown in Table 3 were 
taken between the bet with the largest 
win and the bet with the largest loss in 
the least likely outcome. The inter- 
mediate bets are not included in this 
summary, but were highly consistent 
with the reported results. 

, Among the 46 pairs of bets with equal 
independent variables, 18 sharply dif- 
fer in least likely amount. Table 3 
shows the 13¢ mean difference in 
amount bid between these bets. For 
the 28 pairs which are equal or nearly 
equal in least likely amount, the pre- 
diction was made that Ss would base 
their preferences on the amount of the 
next least likely outcome (i.e., the out- 
come associated with the middle-sized 
probability). "Table 3 shows that the 
mean difference in amount bid was 
small (44) for this group. 

T'hese consistent preferences among 
bets of equal EV, V, Sk, and proba- 
bility were not related to the K values 
of the bets. 

Effects of real gambling.—Detailed 
analyses of individual S's data showed 
that Ss' monetary success or failure 
had little or no effects on Ss’ bids. 
Gambling variables which failed to 
correlate with amount bid were: the 
total amount of money S had previ- 
ously won or lost when he made a 
given bid, the amount of money won 
or lost within a single session, the 
amount of money won or lost on the 
last three gambles, and the number of 
Successive bids refused by Æ. 

Two-outcome bets.—Data from the 
Seven two-outcome bets were exam- 
ined for evidence of preferences among 
the bets, but little preference was 
found. Only six Ss showed a differ- 
ence in amount bid of 15¢ or more be- 
tween the most and least preferred 
bets. These six Ss had no common 
Pattern in their orderings of the seven 
bets, 
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TABLE 3 


DIFFERENCES IN MEAN Amount Bip 
WITHIN SETS OF S WITH 
EQuAL PARAMETERS 


re me with 
o. |Difference Mean 
f % i 
sielet] “|RSS 
Direction 

Quadruples of 
, bets 12 12 (10005) | 18€ 
Triples of bets} 6 6 (100%) | 22¢ 
Pairs of bets 

differing in 

least likely 

amount 18 15 (83%) | 13¢ 
Pairs of bets á P 

same in 

least likely 

amount 28 21 (7590)| 46 


Subject differences.—Vinally, a glo- 
bal view of each S's data for the entire 
experiment was taken to see if there 
were any striking differences among 
Ss. Two Ss were notable. Subject 2 
often bid above EV for bets, and thus 
lost money in the bidding sessions. 
His unprofitable play, however, was 
not highly erratic; his mean standard 
deviation across replications was be- 
low the group average. He was the 
only S who preferred high V bets. 
Subject 6 carefully attended to the 
EV of bets and fairly consistently bid 
3¢ below EV. His data are striking 
for their evenness, showing no effects 
attributable to any parameter other 
than EV. The other 10 Ss, although 
varying slightly in different aspects of 
their performance, seem well repre- 
sented by the group data. 


DISCUSSION 


Moment functions.—This study was 
based on the suggestion that the set of 
moment functions, EV, V, Sk, and K, 
might provide the basis for a decision- 
making model, as an alternative to the 
SEU model. The results must be 
evaluated in the light of this suggestion. 
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First, there was a sizable effect attribu- 
table to EV, as expected. Indeed, it is so 
certain that Ss attend to EV, that failure 
to find a substantial EV effect would 
invalidate the study. Variance also had 
an effect on Ss’ behavior. Ten of the 12 
Ss showed a preference for low V bets. 
In a sense, this result replicates the 
results of Coombs and Pruitt (1960), who 
also found V preferences. But while 
Coombs and Pruitt’s Ss showed wide 
variation in their most preferred V level, 
most Ss in the present study preferred 
low V bets. This difference may be due 
to the difference in motivating conditions 
in the two studies. In the Coombs and 
Pruitt study, Ss were paid a salary and 
did not play any bets during the experi- 
ment. In contrast, Ss in this study 
played one-third of the bets for real 
money, and believed that their responses 
were the major determinant of how much 
money they would win or lose. The fears 
of losing money under such conditions 
may account for the avoidance of the 
riskier, high V bets. 

In the present study, neither the 
third moment, Sk, nor the fourth mo- 
ment, K was related to Ss’ behavior. An 
even more serious obstacle to the mo- 
ment-function approach was the finding 
of large and consistent differences in 
bidding among bets whose first three 
moment functions are equal. These find- 
ings lead to the rejection of the moment 
functions as a set of variables sufficient 
to build a model. 

SEU model.—None of the three pri- 
mary results, the effect of V, the effect of 
the least likely outcome, and the lack of 
probability preferences, contradict the 
SEU model. The preference for low V 
bets indicates that the utility curve for 
money is not symmetric in its extreme 
ranges; that is, that large losses appear 
larger than large wins. The effect of the 
least likely outcome indicates that Ss 
overestimate small probabilities, so that 
the subjective probability curve is not 
linear with objective probability. A non- 
linear subjective probability function 
applied to two-outcome bets suggests the 
existence of probability preferences, while 


the complex, nonmonotonic nature of 
three-outcome bets obscures probability 
preferences. 

Probability preferences.—In this study, 
Ss showed no probability preferences. 
Edwards (1953), using two-outcome bets, 
found probability preferences, but his 
bets confounded probability with V. 
Coombs and Pruitt (1960) also found 
probability preferences but they were 
weak and unsystematic relative to V 
preferences. 

Least likely amount.—The unexpected 
finding of consistent preferences among 
bets of equal EV, V, Sk, and probabilities 
is of interest. Apparently Ss lay undue 
emphasis on the amount of money asso- 
ciated with the smallest probability, 
bidding more for a bet whén this amount 
is positive than when it is negative. This 
effect of the least likely amount is quite 
distinct from the effect of V. To illus- 
trate this distinction, consider the follow- 
ing bets: 


A. 1/8 to win $.12; 2/8 to win $8.83; 
5/8 to lose 8.35. 

B. 1/8 to win $1.00; 2/8 to win $.41; 
5/8 to lose 8.36. 

These two bets have equal EV, V, and 

Sk. Within this pair, Ss bid more for 

Bet B, the bet with the largest least 

likely amount. In contrast are the 

following bets: 

A. 1/8 to win $.12; 2/8 to win $.83; 
5/8 to lose 8.35. 

C. 1/8 to win $8.50; 2/8 to win $3.50; 
5/8 to lose $1.42. 


These two bets have equal EV and Sk 
but differ in V. Here, Ss bid more for 
Bet A, the bet with the smaller V, even 
though Bet C has the larger least likely 
amount. 

These findings suggest that Ss ap- 
proach bets with a lexicographic ordering 
of relevant variables: EV is of primary 
importance. When deciding between 
bets of equal EV, Ss preferlow V. When 
both EV and V are constant, Ss attend 


to the least likely amount, avoiding a 
small chance of large loss. 
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APPARENT SPATIAL POSITION AND THE 
PERCEPTION OF LIGHTNESS ! 


JACOB BECK 


Harvard University 


2 studies investigated the relation of lightness perception to the per- 
ception of spatial position. The results confirm earlier findings that 
lightness perception may be affected by how an O perceives the surface 
to be oriented with respect to the illumination. The results fail to 
support the hypothesis, however, that the apparent position of a surface 
relative to the illumination is used as a basis for computing the albedo 
of a surface. Rather, the general hypothesis the studies appear to 
support is that processes of perceptual organization come into play 
result of the cue properties of stimuli which affect whether a va 


ariation 
in luminance will be seen as a difference in the illumination of the surface 


asa 


or as a difference in the lightness of the surface. 
reduced surface luminance seen in one position as 
other, seen as a gray surface color, in each case 


apparent position of the surface. 


Studies have shown surface light- 
ness to change with changes in the 
apparent depth (Woodworth & Schlos- 
berg, 1954, p. 444), slant (Katona, 
1935; Mach, 1959, p. 209), or tilt 
(Hochberg & Beck, 1954) of a surface. 
Evidently, a description of lightness 
perception must take into account the 
apparent spatial position of a surface 
as well as the light reflected by the 
surface. The effects of apparent 
spatial position imply that not only 
the luminance relationships, but the 
cue properties of the stimulus influ- 
ence lightness perception. The con- 
cern of the present experiments is how 
cues for spatial position affect the 
perception of lightness. 

One hypothesis is to suppose that 
the apparent position of a surface 
enters into computing the albedo of a 
surface. The perception of lightness 
is assumed to be the result of an un- 


1 This research was supported in part by 
funds granted by Public Health Service 
Research Grant No. M-05120-02 to Harvard 
University, Center for Cognitive Studies and 
in part by National Science Foundation 
Grant GB-94 to Harvard University, Psy- 
chological Laboratories. 


Thus, an area of 
a shadow is, in an- 
consistent with the 


conscious computing in which the 
intensity of the reflected light is 
evaluated with respect to the intensity 
of the registered illumination (Wood- 
worth & Schlosberg, 1954, pp. 430- 
431). Change in the apparent posi- 
tion of a surface affects perceived 
lightness because a surface in the new 
position would have to have a lower 
(or higher) albedo to reflect the same 
amount of light to the eyes. Beck 
(1961, 1962), however, has argued 
that perceived lightness and illumina- 
tion are not mutually related in the 
manner implied by the albedo hy- 
pothesis. Moreover, lightness percep- 
tion of uniformly illuminated surfaces 
has been described without recourse 
to the impression of illumination as a 
cue (Helson, 1943; Stevens, 1961; 
Wallach, 1948). 

A second hypothesis is to attribute 
the change in lightness to processes of 
perceptual organization that come 
into play in dealing with the distribu- 
tional properties of reflected light. 
What is supposed is that operating to- 
gether with sensory processes, such as 
contrast and adaptation, there are 
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organizational processes which influ- 
ence lightness perception by the way 
a stimulus pattern? is organized into a 
systematic whole. That is, as a result 
of the cues provided by the distribu- 
tion and character of the luminance 
variations, the apparent direction of 
illumination and the apparent position 
of a surface, luminance variations may 
be perceived either as differences in 
illumination or as differences in light- 
ness. Fora coplanar surface and sur- 
round perceived to be illuminated 
uniformly, the hypothesis asserts that 
luminance variations would be per- 
ceived as differences in surface light- 
ness enabling lightness perception to 
be expressed as a function of the 
distribution of luminances in the field 
(Stevens, 1961; Wallach, 1948). For 
a surface perceived to be illuminated 
directionally, however, the organiza- 
tion of the stimulus pattern may be 
supposed to operate to maximize the 
congruences between different cues. 
Thus, an area of reduced surface 
luminance seen in one position as a 
Shadow, is, in another, seen as à gray 
Surface color, in each case consistent 
with the apparent position of the 
Surface. 

'The two hypotheses make different 
predictions concerning the effects of 
Spatial position on perceived lightness. 
The albedo hypothesis asserts that 
Derceived lightness is given by the 
equation 4 = S/M (Woodworth & 
Schlosberg, 1954, p. 431) where M 
7 the registered illumination, S = in- 
tensity of the reflected light, and 


2 Since the hypothesized organizational 
Process may take place only after a number of 
Prior perceptual functions, such as contrast 
Processes, contour processes, and so forth, the 
Phrase stimulus pattern is used as @ neutra 
term to avoid implying that the organiza- 
tional process operates directly on either the 
array of light stimulating ihe eye or the 
Proximal stimulus pattern, vid. the discussion 
Section of this report. 
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A = the perceived lightness or albedo. 
This formula implies: (a) altering the 
apparent position of a surface so as to 
modify its apparent illumination is a 
sufficient condition for a change in 
surface lightness, and (b) for a given 
retinal stimulus there is a proportional 
variation between changes of apparent 
illumination and lightness. The sec- 
ond hypothesis asserts that lightness 
perception can be analyzed as a two- 
step process. F irst, sensory processes 
act to determine a stimulus pattern in 
accordance with the distribution of 
luminances reflected, and, second, as 
a result of the cue properties of the 
stimulus, the stimulus pattern is 
organized in terms either of differences 
in surface lightness or differences in 
illumination. Implications of this hy- 
pothesis are: (a) perceived lightness 
is affected only if a change in the ap- 
parent position of a surface induces a 
new organization of the stimulus 
pattern, and not simply because of the 
altered impression of illumination, and 
(b) the range of lightness perception 
is a function of the stimulus pattern 
determined by the distribution of 
]uminances reflected and not of thc 
amount of change in the apparen' 
illumination. 

The experiments to be reportec 
provide a test of the albedo hypothesi: 
and its implications. They also serv 
to clarify the factual background o 
the relation between the perception 
of lightness and of spatial position. 


Srupy I 


Study I was designed to examir 
the stimulus conditions controllir 
lightness changes in Mach's (195 
p. 209) demonstration. 


Method 

The experiments were conducted in 
darkroom. A folded card was placed or 
table behind a cardboard partition so that 
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Fic, 1. 


left side was illuminated while its right side 
was shadowed. The table was covered with 
a black cloth. Apertures cut in the screen 
fixed to one end of the table allowed Os to see 
the target with either monocular or binocular 
vision. Changes in the apparent position of 
the target were induced by drawing rows of 
dots making an angle of 13° with the hori- 
zontal on each side of the target. When cues 
for slant were eliminated by limiting O to 
monocular vision, the presumption to see the 
dots arranged in straight lines caused the 
target to appear to stand flat directly behind 
the partition. When the target was viewed 
binocularly, the target was seen in its Correct 
spatial position and the rows of dots were 
seen as slanted. 

Four experiments were conducted. In 
Exp. I and II, the illumination (150 w.) came 
from a ceiling lamp. The illuminance of the 
illuminated side of the target was 4.2 foot- 
candles (ftc.) and of the shaded side of the 
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Illustration of the experimental arrangement in Study I. 


target 2.8 ftc. When the target was seen as 
folded, cues indicated that the left side was 
directly illuminated and the right side was 
indirectly illuminated, When the target was 
seen as flat, both sides appeared to be il- 
luminated equally. The illuminance of the 
left and right sides of a white paper of the 
same reflectance as the target and subtending 
the same visual angle when placed flat 
directly behind the partition was equal to 4.2 
ftc. Thus, changes in the apparent spatial 
Position of the target altered the apparent 
illumination of the shaded side of the target 
but not of the illuminated side, In Exp. 
III and Iv, light from a Projector located 
i ht was added to the over- 
s projector beam actually 
missed the target falling only on the card- 
the wall behind. How- 


s hen apparently flat ap- 
Deared to receive the bright light of the 


projector. The illuminance of the right and 
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TABLE 1 
MEDIANS or Os' LIGHTNESS MATCHES OF THE SHADED AND ILLUMINATED SIDE 
or THE MACH CARD WITH DIFFERENT APPARENT POSITIONS f 
Shaded Side Illuminated Side 
Exp. Exp. 
Apiay | MEET Lc we 
I 8.5 9 II 9 9.25 
ul 6.75 8 IV 8.5 8.5” 


Note.—A sign test is significant for Exp. I and III at p <.001. 


left sides of a white paper of the same reflect- 
ance as the target and subtending the same 
visual angle as the target was 165 ftc. when 
placed flat behind the partition so as to inter- 
cept the light of the projector. When seen 
in its true position, the target was seen to 
stand in the general illumination provided by 
the overhead lamp. Thus, in Exp. III and 
IV, both the illuminated and shadowed sides 
of the target were seen to be much more 
strongly illuminated when the target was 
apparently flat. 

The Os were instructed to judge the light- 
ness of the shadowed side of the target in 
Exp. I and III, and of the illuminated side of 
the target in Exp. II and IV. Each O was 
told that he would be given two looks at the 
target. His task was to note the lightness of 
the indicated side of the target and on the 
second look to match it to one of the grays on 
the Munsell scale. Each look was approxi- 
heey, 10 sec. All Os viewed the target first 

inocularly and then, after an interval of 20 
730 min., monocularly. Twelve Os served 
in Exp. I and III, and 10 Os in Exp. II and IV. 

1l Os were undergraduates who were paid to 
Participate and were naive as to the purpose 
of the experiments. 


Results and Discussion 


Table 1 presents the medians of Os’ 
Munsell matches in each of the four 
experiments. The results show that 
the shadowed side of the target 1s 
Judged darker when the target is ap- 
parently flat than when the target is 
seen in its true position. The differ- 
ence is .5 of a Munsell step in Exp. I 
and 1.25 Munsell steps in Exp. III 
(sign tests p < .001). A sign test of 
the increased lightness difference 1n 
Exp. III is significant at the .01 level. 


The Os' lightness judgments of the 
illuminated side of the target in Exp. 
II and IV, however, were not changed 
with changes in apparent spatial 
position. Regardless of apparent 
position, Os' lightness matches in Exp. 
III and IV with the added luminance 
of the projector were below those in 
Exp. I and II with only the overhead 
light (sign test $ < .001). Thus, the 
results indicate that the perceived 
lightness of the target was affected by 
both the luminances in the field and 
the apparent position of the surface. 
The question posed is: How do cues 
for spatial position influence the per- 
ception of lightness? The results fail 
to support the albedo hypothesis. 
The small changes in Exp. I and III 
indicate that perceived lightness does 
not vary directly with the apparent 
illumination. Had the target actually 
been located in the perceived position, 
the illuminance of the shadowed side 
of the target would have been in- 
creased by a factor of 1.5 in Exp. I and 
58.9 in Exp. III. The failure of a 
lightness change to occur in Exp. IV 
indicates that a change in the ap- 
parent illumination is not a sufficient 
condition for a lightness change. Had 
the target actually been located in the 
perceived position, the illuminance of 
the illuminated side would have been 
increased by a factor of 39.3. Both 
the small lightness change of the 
shadowed side and the failure of a 
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lightness change to occur on the 
illuminated side are in agreement 
with the second hypothesis proposed. 
They suggest that the apparent posi- 
tion of the target is important in 
perceiving the area of reduced lumi- 
nance either as a difference in surface 
lightness or as a difference in illumi- 
nation. When the target is seen 
folded, the right side is seen as in- 
directly illuminated, and the area of 
lower luminance is perceived as an 
area of lowered illumination or as a 
shadow. However, when the target 
is seen as flat, perception of a shadow 
is no longer consistent with the ap- 
parent uniform illumination of the 
right and left sides of the target. The 
perception therefore changes to re- 
move this discrepancy and the area 
of lower luminance is perceived as an 
area of lower surface reflectance, i.e., 
a darker surface color. Since no 
discrepancy, however, is introduced 
with the change of apparent position 
on the illuminated side, no lightness 
changes would be expected to occur. 
The results, moreover, indicate that 
the range of lightness perceptions is 
determined by the luminance dis- 
tribution reflected and not by the 
apparent illumination. If a surface is 
perceived to be illuminated uniformly, 
perceived lightness is determined by 
the relationship of luminances jn 
adjacent regions, i.e., contrast. Thus, 
as would be expected on the basis of 
contrast, Os’ lightness judgments with 
the added luminance of the projector 
in Exp. III and IV were below those 
in Exp. I and II. If a surface is 
perceived to be directionally illumi- 
nated, however, a change in perceived 
lightness may occur as a result of a 
new organization of the stimulus 
pattern. 
Effects of set.—The previous discus- 
sion suggests that whether an area of 
lower luminance will be perceived as a 
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shadow or as a darker surface color 
may depend on Os’ readiness to infer 
from the available cues that the 
reduced luminance is the consequence 
of a reduced illumination or of a re- 
duced surface reflectance. This possi- 
bility was able to be tested in terms 
of an unexpected observation. With 
sophisticated Os, the shadowed side 
of the target was judged darker in- 
dependently of whether Os viewed the 
target monocularly or binocularly 
first. With unsophisticated Os, how- 
ever, the order of observation proves 
to be crucial. If Os first see the target 
as flat (monocularly) and then folded 
(binocularly), no consistent changes 
of lightness occur. In one experiment 
of 12 Os, 5 Os judged the shaded side 
of the target to be darker when seen 
as flat, 5 Os judged it to be darker 
when seen folded, and 2 Os judged the 
lightness to be the same. In a second 
experiment of 11 Os, 3 Os judged the 
shaded side to be darker when seen 
folded, 4 judged it darker when seen 
flat, and 4 judged the lightness to be 
the same. The effect of order of 
observation indicates two important 
points. First, the results are again 
not consistent with the albedo hy- 
pothesis. Secondly, they rule out an 
explanation of the lightness changes 
as due to binocular interaction.? 
According to either of these hypoth- 
eses, the order of observation should 
make no difference. The observed 
asymmetry, however, is compatible 
with the second hypothesis proposed. 
If the target is initially seen as two 
gray shades, Seeing it in its true 
position introduces no marked in- 


. ? The hypothesis of binocular interaction 
is also ruled out by the fact that lightness 
changes will occur with changes in the ap- 
parent position of a target without changing 
from monocular to binocular observation, vid. 
Mach (1959, p. 209), Katona (1935), and 
Hochberg and Beck (1954), 
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consistency to induce a reorganization 
of the percept and no consistent 
changes of lightness would, therefore, 
be expected to occur. A lightening 
of the shaded side may be expected to 
occur with binocular observation fol- 
lowing monocular observation if a 
Preparatory set induces Os, when the 
target is seen in its true position, to 
perceive the area of lower luminance 
as an area of lower illumination. Such 
a set was established in two ways. 

In one experiment, Os viewed the 
target first, binocularly-monocularly, 
and then after 10 min., viewed the 
target monocularly-binocularly. The 
expectation was that by first seeing 
the target binocularly-monocularly 
the presumption would be strength- 
ened to see the right side of the target 
as shaded when the target was seen 
in its true position following monocu- 
lar observation. Only the overhead 
light was on and an interval of 20- 
30 min. separated each of the binocu- 
lar-monocular judgments. Thirteen 
Os Were run. Eight of the 13 Os on 
Seeing the target binocularly after 
Seeing it monocularly judged the 
ems side of the target as lighter, 
k ile 5 reported the lightness of the 
arget to remain the same. A further 
experiment indicated that consistent 
lightness changes could be obtained 
by providing Os with greater experi- 
ence than one trial. Five Os who had 
viewed the target binocularly-mo- 
Nocularly a number of times all 
judged the shaded side of the target 
as lighter when viewing the target 
binocularly following monocular ob- 
Servation. With one trial, however, 
only a short interval should elapse be- 
tween the binocular-monocular judg- 
ments, since in an initial experiment 
little or no success was obtained witha 
day interval. In a second experiment 
Set was introduced by means of in- 
Structions. The instructions were 
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designed to induce an object-directed 
attitude when viewing the target 
binocularly. When the target was 
seen in its true position, the instruc- 
tions emphasized that the target was 
a spotted card which had been folded 
in its center like a bookcover. Forty- 
five minutes separated the monocular 
and binocular judgments. Fifteen 
Os were run. Eleven Os reported the 
target as lighter when seen in its true 
position, and 4 reported the lightness 
of the target to remain the same. 


Srupy II 


The effect of apparent spatial 
position on perceived lightness was 
also investigated using the experi- 
mental arrangement of Hochberg and 
Beck (1954). They have reported 
that a fixed target under constant 
ilumination is judged darker when 
made to appear perpendicular to the 
apparent diréction of illumination 
than when made to appear parallel to 
the direction of illumination. In a 
number of preliminary experiments, 
the position of the target was found to 
determine whether Os' lightness judg- 
ments were altered with changes in 
apparent position. The first investi- 
gation was designed to examine the 
effect of target position. 


Method 


The target, a trapezoid, was cut so that 
when placed upright its retinal image would 
be the same as that of a rectangle lying on a 
table surface. The trapezoid was covered 
with a dark gray paper having a reflectance of 
approximately 9.4%. The Os viewed the 
target through apertures cut in a screen fixed 
to one end of the table. These allowed Os to 
observe the target with either monocular or 
binocular vision. The target was seen as an 
upright trapezoid when cues for slant were 
present, and as a horizontal rectangle when 
cues for slant were absent. 

The target was placed in four positions as 
shown in Fig. 2. The distances from the 
aperture to the center of the target were ap- 
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Position 1 


Position 3 
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Position 4 


Fic. 2. Photographs of the table arrangement in Study II. 
(The target is shown in each of the four positions.) 


proximately 46 in. in Position 1, 48 in. in 
Position 2, 50 in. in Position 3, and 57 in. in 
Position 4. The illumination was provided 
by an adjustable lamp attached to the table 
so that the light fell from above and behind 
Position 1. Ample cues Wére present to 
indicate the direction of illumination and its 
distribution. Changing the position of the 
target had two effects, First, it altered the 
distribution of light and Shade over the sur- 
face. In Position 1 the target was heavily 
shadowed; in Positions 2 and 3 the degree of 
shadowing was decreased; in Position 4 the 
target did not appear shadowed at all. Sec- 
ondly, the amount of change in apparent 
illumination resulting from Changes in the 
apparent position of the target varied. The 
change of the target from apparently upright 
to apparently horizontal altered the Perceived 
illumination most strongly in Position 1. The 
illuminance of the central area of the target 
was 13.5 ftc.; the illuminance of the central 
area of a gray paper of the same reflectance 
as the target placed flat on the table to 
correspond to the target projection was 140.6 
ftc. The illuminances of the target and of a 
gray paper of the same reflectance as the 


target placed to correspond to the target 
Projection on the table were 31.3 ftc. and 125 
ftc., respectively, in Position 2; 56.3 ftc. and 
93.8 ftc., respectively, in Position 3; and 62.5 
ftc. in Position 4. Thus, if the target were 
actually a rectangle flat on the table the 


Forty Os participated in the experiment ; 
10 Os judged the lightness of the target in 
each of the four Positions. The Os were naive 
regarding the Purpose of the experiment and 
Were paid to Participate, Each O made two 
judgments. He matched the lightness of the 
target to one of the grays on the Munsell 
scale when it was apparently upright and 
when it was seen to lie flat on the table. 


Results and Discussion 

The medians of Os lightness 
matches in the Íour positions are 
shown in Table 2. Though in Posi- 
tions 1, 2, and 3 the Many cues for the 
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TABLE 2 


MEDIANS or Os! LIGHTNESS MATCHES 
OF THE TARGET PERCEIVED UPRIGHT 
AND HORIZONTAL 


es Positions | Rectangle Trapezoid 
| 
1 3.5 4 
2 | 4 | 4.5 
3 | 5.25 | 5 
4 | 5.5 | 5.5 


Z | i | 


cant geen Position 1 lightness change was signifi- 


illumination would indicate the target 
to be more strongly illuminated when 
apparently flat, a significant lightness 
change occurred only in Position 1. 
The median change in Position 1 was 
> of a Munsell step. The results con- 
firmed again that a change in apparent 
illumination is not a sufficient condi- 
tion for a change in apparent lightness 
and that apparent lightness does not 
Vary as a linear function of the ap- 
Parent illumination. 
Mae results pose the question of 
YY there was a consistent change of 
'ghtness with a change of apparent 
onem in Position 1 but not in the 
esed positions. The answer to this 
cae i would appear to be tied to a 
nor € basic question. The targets did 
ri reflect light uniformly, but a dis- 
„ution of luminances. This dis- 
ribution, however, yielded a specific 
dijesPtion of surface lightness. How 
Pen this Occur? A hypothesis consis- 
th t with what has been proposed is 
boi a specific lightness perception 
“ee ts from an organization of the 
Mulus pattern in terms of the cues 
3 ed by the character and dis- 
a ton of the luminance variations. 
Ph is, due to the shape, size, edge 
s „ents, and distribution of the 
Minance variations, luminances dif- 
cs from a particular value are seen 
fe adows, highlights, and light spots 
"er than as surface lightness differ- 
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ences (Beck, 1964). Thus, parts of 
the target were perceived as evenly 
illuminated, parts as shadowed, and 
parts as highlighted. As with the 
Mach card, the change of lightness in 
Position 1 demonstrates that this 
organization of the stimulus pattern is 
not unique but can depend on the 
apparent orientation of the surface. 
In general, however, whether a change 
in apparent position induces consis- 
tent changes in surface lightness, may 
be expected to depend on the pattern 

of luminance variations over the 
surface. For example, a preliminary 

experiment suggests that whether Os 

reported the target darker or lighter 

when apparently flat in Position 2 

depends on whether they tended to 

fixate a shadowed or highlighted area 

of the target when it was seen up- 

right. When told to fixate the upper 

portion of the target (which when 

seen upright was seen as highlighted), 

8 of 10 Os judged the target when 

apparently flat to be lighter than 

when apparently upright. When told 

to fixate the lower portion of the 

target (which when seen upright was 

seen as shadowed), 7 of the same 10 

Os judged the target when apparently 

flat to be darker than when apparently 

upright. A binomial sign test of the 

lightness changes due to the change 

in fixation is significant at the .02 

level. The consistent darker judg- 

ments when the target was apparently 

flat in Position 1 may reflect the fact 

that the target in this position was 
seen to be almost entirely shadowed 

when apparently upright. Unfortu- 
nately, data are not available relating 
Os’ pattern of fixations to the lightness 
changes which occurred with changes 
in apparent position. 

Effects of contrast with background — 
To separate the effects of contrast 
from the effect of cues for spatial 
position and illumination direction, 
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TABLE 3 


MEDIANS OF Os’ LIGHTNESS MATCHES 
OF THE TARGET PERCEIVED UPRIGHT 
AND HorizONTAL WITH DIFFERENT 


BACKGROUNDS 
| Rectangle | Trapezoid 
m | ES 
Black background 4.25 | 4.75 
White background 2:5 | 4 


Note.—A Friedman two-way analysis of variance is 
significant at the .001 level. 


an experiment was conducted in which 
Os judged the lightness of a target 
with a black and a white background. 
Because of contrast, regardless of its 
apparent orientation, the target on a 
white background should be judged 
darker than on a black background. 
On the other hand, the target should 
be judged on both a black and a white 
background as darker when perceived 
perpendicular to the direction of 
illumination than when perceived 
parallel to the direction of illumina- 
tion. The experimental arrangement 
and procedure were similar to that of 
the previous experiment. The inci- 
dent illumination was set to come 
from overhead. The illuminance of 
the central area of the target was 30.2 
ftc. The illuminance of a gray paper 
of the same reflectance placed flat on 
the table to correspond to the target 
projection was 62.5 ftc. Ten Os were 
run who judged the lightness of the 
target with both a black and a white 
table cover. A balanced order was 
employed with half of the Os viewing 
the target on each background first, 
Table 3 presents the medians of Os’ 
lightness matches. The results in- 
dicate that the target was judged 
darker when apparently flat than 
when apparently upright, independent 
of whether it was on a black or a white 
background. Similarly, Os judged the 
target on a white background to be 
darker than on a black background 
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independent of the orientation of the 
target. A Friedman analysis rejects 
the hypothesis that lightness matches 
at the different apparent positions and 
backgrounds differed only by ‘chance. 
Sign tests of the differences of Os' 
lightness judgments due to the ap- 
parent position were significant at the 
-01 level for both a black and a white 
background; the differences due to the 
background were significant at the .01 
level at both apparent orientations. 
Thus, the results separate the effects 
of contrast from the effects of ap- 
Parent spatial position and illumina- 
tion direction on Perceived lightness. 


Discussion 


The results of these experiments sup- 
port earlier findings that the cue prop- 
erties of stimuli as well as the luminance 
relationships affect lightness perception. 
The data, however, are inconsistent with 
the hypothesis that surface lightness is 
the result of computing surface albedo. 
Rather, the general hypothesis the 
studies appear to support is that certain 
processes of organization come into play 
as a result of the cue properties of stimuli 
which affect whether a variation in 
luminance will be seen as a difference in 
the illumination of the surface or as a 
difference in the lightness of the surface. 
Though going beyond the scope of the 
data, a brief discussion of two questions 
relating to the hypothesized organiza- 
tional process is useful, 

The hypothesized organizational proc- 
ess is assumed to be of central origin in- 
volving the integration of differentiated 
elements into a systematic whole. A 
characterization of the organizational 
Process must, among other things, specify 
the characteristics of the stimulus pat- 


tern which is centrally integrated, Stud- 
les of light 


the central 


more peripheral 
Such as the processes 
aptation, processes of 
and surface formation 


sensory processes, 
of contrast and ad 
contour formation 
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(Wallach, 1949). What is suggested is 
that the output of these more peripheral 
Processes, i.e. a “lightness pattern," con- 
stitutes the input to the presumed 
organizational process. Thus, though 
the perception of surface lightness cannot 
be brought into a one to one correspond- 
ence with the luminance distribution, the 
luminance distributions through the 
Operation of the peripheral sensory proc- 
esses limit the range of surface lightness 
Perceptions. Moreover, through the 
determination of edge characteristics and 
mode of appearance, i.e., whether a light- 
ness gives the appearance of a film or a 
ense surface, the perception of shadow 
ped light is influenced greatly by the 
Uminance distribution. A second ques- 
tion concerns the way in which different 
oom Operate to produce an organiza- 
‘on of the stimulus pattern. The 
ma of the stimulus pattern isa 
om ex relational affair involving the 
Peta of multiple factors, such as 
Su relating to the stimulus pattern; 
el relating to the illumination and 
Sa pienk spatial position of a surface; 
actors of memory, set, and attitude 
oodworth & Schlosberg, 1954, Ch. 
tendens, well as certain organizational 
oe res (Attneave, 1954). Exactly 
a» bs ese factors cooperate in achieving 
eren dem of the stimulus pattern 
Hine SA by the peripheral sensory 
ion s is not known. The organiza- 
Would the stimulus pattern, however, 
respon appear to involve more than a 
se to the potency of individual 


4 
feni term “lightness pattern" is not in- 
i to imply that there is an awareness, 
Sensations, which corresponds to the 
— patterns resulting from the peripheral 
Chara, Y Processes. The term is used solely to 
contare ize through analogy the information 
Deura nes in the neural pattern (that is, the 
hance diacitations resulting from the lumi- 

Istribution) prior to organization. 


Neur; 
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cues. The experiments in Study I, for 
example, suggest that there exists a 
presumption to see a surface as being a 
single lightness if the conditions are 
consistent with the variations of lumin- 
ance being seen as shadows and high- 
lights. A convenient working hypothesis 
is probably to assume that the organiza- 
tion of a lightness pattern is such as to 
maximize the congruences between the 
different cues and to provide an econom- 
ical summary of the stimulus pattern. 
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The effect of very low drive states on the probabilities of a dominant 


and a competing response were investigated. 
was established through training procedures with do 
peting responses having different reinforcement probabilities. 


40 male, college students. 


The response hierarchy 
minant and com- 
Ss were 


The results were consistent with the hy- 


pothesis, based on certain constructs from Hull-Spence theory, that 
competing responses are facilitated by reduction of drive within the 


range of low drives. 


A generally recognized assumption, 
particularly in the areas of counseling 
and psychotherapy, is that, when 
behavior patterns have been well 
learned, behavior change is facilitated 
by establishing a low, or subnormal, 
drive state (relaxed, low-anxious) in 
the individual. This is implemented 
by techniques such as the use of the 
couch, the creation of good rapport, 
making the person comfortable, and 
the like. Wolpe (1958) has been one 
of the most explicit users of this as- 
sumption. In the interest of modify- 
ing strong dominant anxiety responses, 
he has used specific relaxation pro- 
cedures. One way of looking at 
procedures like the above where re- 
laxation has been used is to view 
relaxation as a response which com- 
petes with the dominant response, 
anxiety. Another more general way 
of seeing the role of relaxation in be- 
havior change is to view relaxation, 
not necessarily as a response, but as a 
lowering of a general drive state 
through which the relative superiority 
of any dominant response would be 


1The authors wish to thank Fred Shima 
for running Ss and Steve Young for setting 
up the apparatus. This research was sup- 
ported by funds from the University Research 
Committee. 


It was concluded th 
views on the important role of low drive i 


at the results support existing 
n the modification of behavior. 


reduced and any competing response 
would be facilitated. If “subnormal” 
drive in this latter sense is effective 
in increasing the probabilities of non- 
dominant responses, the use of relaxed 
states would have general importance 
for many areas where behavior change 
is desired. Examples are the areas of 
response originality (Maltzman, 1960) 
and creativity (M ednick, 1962), where 
theoretical discussions have empha- 
sized the availability and use of non- 
dominant responses. 

The purpose of this experiment was 
to investigate the effect of lower than 
"normal" drive (very relaxed) states 
on the probabilities of a dominant (.8 
reinforcement probability) and a com- 
peting (.5 reinforcement probability) 
response. The Ss were initially 
trained in this two-response situation 
under conditions usually designated 
as low drive. It was hypothesized 
that further reduction in drive below 
an already low drive state would 
produce an increment in the response 
Which has the lower probability of 
reinforcement, with a corresponding 
decrement in the response which has 
the higher probability of reinforce- 
ment. This hypothesis was contin- 
gent on the assumption that the 
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response strength of the competing 
response would remain above thresh- 

old even in the low drive condition. 
The hypothesis is consistent with 
certain constructs from Hull-Spence 
theory (Spence, 1956). According to 
the multiplicative relation between 
drive (D) and habit strength (H), the 
evocation probabilities of the com- 
Peting response should increase with 
reduction in D. This follows from the 
assumption that the probability of a 
response is a function of the difference 
in the response strengths between the 
dominant response (Za) and the com- 
Peting response (E,). The magnitude 
of the response strength superiority 
of the dominant response over the 
competing response is seen as a direct, 
multiplicative function of D, Ea — E. 
= ÍÍ[DQür, — H,)]. Thus, a decrease 
m D should be accompanied by a 
€crement in the probability of the 
dominant response and a relative in- 
Crease in the probability of the com- 
dendi response. If these hypothe- 
B F changes in response probabilities 
hey ound in the present experiment 
ae or be considered to be a 
“ee lon of the lower drive since rein- 
Sanie Do conditions remained the 
oth during training under low 


dri $ 3 
nive and testing under still lower 
drive, 


METHOD 


psy cies — he Ss were 40 introductory 
Assigned t. class male students. Ten Ss were 
ese- = t0 each of the four treatment groups 
Cribed later, 
Pbaratus.—The S and the control equip- 
rend in adjacent rooms separated by a 
tained”, Vision window. The S’s room con- 
Shapeq an interviewing couch with wedge 
Speake, Cedrest, a finger response unit, a 
er, and a microphone. 
b small nee response unit was anchored on 
Compri. d beside the couch. The unit 
Surface € a block of wood with a 23 X 54 in. 
e c Covered by a stainless steel plate. In 
Rin, citral region of the surface were two 
ameter holes spaced 1} in. apart on 


me 
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center. Each hole contained a finger contact 
pin that projected up to the level of the steel 
plate. An adjustable hand-forearm rest that 
extended from the side of the block could be 
adjusted for the angle of S's arm as it rested 
in relaxed position along the couch, and for 
the length of forefinger. In each trial in- 
terval, simultaneous finger contact with a pin 
and the steel plate activated a transistor relay 
circuit which operated impulse counters. 

Starting with forefinger on one pin, S was 
required to make one of two responses in 
each trial, either move his finger to the other 
hole (Move response) or not move his finger 
at all (Stay response). Total trials and 
frequencies of each type of response were 
recorded on counters. 

Procedure,.—The S’s room was dimly 
lighted by a 25-w. bulb. Illumination was 
just sufficient for E to observe and tabulate 
gross motor movements which provided one 
check on S's state of relaxation. 

The S was asked to lie on the couch and 
instructed to learn to make the correct re- 
sponse following each signal to respond (an 
approximately ys-sec. duration, high-fre- 
quency tone). This would be followed by 
another signal (1-sec. duration, low-frequency 
tone) if his response was correct and no tone 
if incorrect. (However, in fact, one response 
was randomly reinforced .8 of the time and 
the other was randomly reinforced .5 of the 
time.) The interval from the respond signal 
to the point where reinforcement might be 
given was 3 sec., and from that point to the 
next respond signal was 2 sec. 

The four groups of Ss are distinguished by 
the response (Move or Stay) which had the 
low probability of reinforcement and was thus 
established as the competing response, and 
by the treatment given between the training 
and the test sessions. For two of the groups 
Stay was the competing response and Move 
was the dominant response, and conversely for 
the other two groups. Because the focus of 
the experiment was on changes in probability 
of occurrence of responses which are hier- 
archically ordered, a learning criterion was 
used to select Ss with the .8 reinforced re- 
sponse clearly dominant by the end of the 
training trials. Only those Ss were used who 
attained a criterion of 70% or greater domi- 
nant responses (30% or less competing 
responses) over the last 25 training trials. 
Seventy of 110 initial Ss did not meet the 
training criterion. 

The training procedures and response 
criterion that resulted in the rejection of the 
large number of Ss perhaps requires comment. 
The .5 and .8 reinforcement probabilities were 
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chosen with the objective of establishing a 
stable dominant response, yet not so superior 
in habit strength that the probability of an 
increment in the competing response would be 
negligible under the amount of drive reduction 
used in the experiment. Using these rein- 
forcement probabilities, the criterion of 70% 
dominant responses over the last 25 of 100 
training trials was too severe. Of the first 60 
Ss only 15 reached criterion. When the 
procedure was changed to give 150 training 
trials 25 of 50 Ss reached criterion. The 
proportions of Ss reaching criterion by each 
of the two procedures were nearly identical 
in the experimental and control groups. 

Within each pair of groups that had the 
same competing response, one group received 
a drive reduction (relaxation) treatment be- 
tween training and test sessions. The other 
group (control) was given a nonrelaxation 
experience. Thus the four groups are 
designated as: Relax-Move (i.e, relaxation 
treatment with Move being the competing 
response), Relax-Stay, Nonrelax-Move, and 
Nonrelax-Stay. 

The drive-reduction procedure consisted 
of telling S over the speaker to follow in- 
structions for four relaxation exercises recom- 
mended by Jacobson. Slightly modified from 
the original (Jacobson, 1938), these instruc- 
tions were spoken slowly and with pauses to 
pace S’s movements: 


1. Raise both arms straight up. Clench 
your fists. Hold very tightly. Now 
slowly let your arms fall to your sides and 
go completely limp. 

2. Now push with your heels straight out 
toward the wall, hard, and as far as you 
can. Hold this position tightly. 
let your legs relax against the cot. 

3. Now take a deep breath—and when 
you exhale let your chest drop completely. 

4, Next wrinkle up your forehead as 
tight as you can, raise your eye brows as 
high as possible. Then gradually let go 
and relax your face completely. Relax 
your forehead and your eyes. Relax your 
mouth and jaws—completely relax. 


Slowly 


The control groups were asked to solve a 
problem mentally; to take the number 10,000, 
divide by 4 until the smallest possible quotient 
was obtained, and when an answer was derived 
to recheck it. Following the respective in- 
structions all Ss were told that the task in the 
second session was to be identical to that in 
the first. After the test trials, each S rated 
iis degree of relaxation in each of the preced- 
ng two sessions on a 7-point scale ranging 
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from Not relaxed to Very relaxed. A selí- 
rating anxiety inventory (Nakamura, 1960) 
was administered at the end of the experiment. 

The main stages in the procedure may be 
summarized: 100 or 150 training trials, 4-min. 
interval during which the two experimental 
groups received relaxation exercises and the 
two control groups mentally solved the 
arithmetic problem, 100 test trials, self- 
ratings on relaxation scale, and anxiety in- 
ventory. The reinforcement conditions re- 


mained the same over both training and test 
trials, 


RESULTS 


The mean percent competing re- 
sponses in the last 25 trials of training 
are shown in the left side of Fig. 1. 
The analysis of variance applied to 
the mean scores indicated that differ- 
ences between the Relaxation Treat- 
ment and Response Type groups, and 
the Treatment x Response Type 
interaction were all nonsignificant. 
Thus the four groups had attained 
essentially the same response strength 
of the competing response at the final 
Stage of training. A further check 
of the comparability of groups on 
training response strengths was made 
by comparing the Relax and the Non- 
relax groups on percent competing 
responses in the 25 trials preceding 
the 25 criterion trials. This difference 
was also clearly nonsignificant, 

The mean percentages of the com- 
peting response in blocks of 25 test 
trials are presented by the connected 
points in Fig. 1, Percent competing 
responses for the two Nonrelax groups 
remained about the same level over 
blocks of trials, In contrast, the 


curves for the two Relax groups show 
Increments over blocks, 


An analysis of variance using the 


difference scores between the mean of 
the last 25 training trials and the 
mean of the 100 test trials yielded a 
significant Treatment effect, F (1, 36) 
= 8.21, f. « .01. This confirmed the 
apparent differences shown by the 
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curves in the amount of increase in 
percentage of competing responses 
between the Relaxed and Nonrelaxed 
Treatment groups. The Response 
Type effect and the Treatment X Re- 
sponse Type interaction effect were 
Not significant. 

A similar analysis of variance ap- 
plied to the difference scores between 
the last 25 training trials and the first 
25 test trials yielded a significant 
Treatment effect, F (1,36) = 4.40, 
P < .05. An analysis of the difference 
between the last 25 training trials and 
Test Trials 76-100 was also significant, 
F (1, 36) = 12.49, p < .01. Together, 
these results indicated that there was 
a slight but significant immediate 
effect of the Treatment and that this 
effect was maintained to the end of 
testing. The effect of the Treatment 
appeared to increase over trials. 

Owever, a trend test of the linear 
Component yielded nonsignificant re- 
sults. The Treatment effect did 
2pproach significance, F (1,36) = 3.81, 
P< 10: 

"ns the secondary measures ob- 
Mes in the experiment, only the 
tio Jective ratings by Ss on the relaxa- 
R n scales approached significance, 
A 36) = 3.72, p < .10. The Ss in 
hos Relaxation Treatment groups 
ed to rate themselves more re- 
axed in the test-trials session than in 
beg training session while the non- 
d prion groups showed no measur- 
€ changes, Parenthetically, it may 

one mentioned that the measurable 
cant of change was limited by a 
"ling. The relaxation ratings of the 
in Session were already very high 
h an overall mean of 48 on a 
“Point scale. Analyses of scores for 
rati remaining two measures, E's 
iet, © Of S’s movements, and the anx- 

Y inventory, were nonsignificant. 
clud; Ss used in the experiment, in- 

Ing those that were rejected, were 
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BLOCKS OF 25 TEST TRIALS 


Fic, 1. Mean percent competing respon 
in the last 25 training trials, and over fc 
blocks of test trials for four groups of 10 S 


randomly assigned to the Relax ai 
Nonrelax treatments and given te 
trials as well as training trials sin 
there was not time to score f 
criterion performance level imme 
ately following the training sessio 
Thus, test data were available for t 
70 Ss who were rejected because th: 
did not meet the training criterion 

70% dominant responses. Inspectic 
of these data showed that the mec 
response levels of the .8 reinforcemei 
probability response in the last 4 
training trials were 48% and 499 
respectively, for Ss assigned to tl 
Relax and Nonrelax treatments [i 
Ss in each group). The means of tl 
.8 reinforcement response over tl 
total test trials were an identical 53° 
for both groups; the response leve 
plotted over four 25-trial blocks bein 
generally flat with a very slight i1 
crease over blocks. Thus, the overa 
performances of the rejected Ss a 
signed to the two treatments we 
quite similar. The data sugges 
furthermore, that Ss in whom 

dominant response is not establishe 
in 100 or 150 training trials with tl 
procedures used in this experiment a1 
not likely to attain a clearly dominar 
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response in an additional 100 trials 
of a second session. 


DISCUSSION 


The results were consistent with the 
hypothesis that competing responses are 
facilitated by reduction of drive at 
relatively low drive levels. In the re- 
sponse with the lower probability of rein- 
forcement, which was established as the 
competing response, the relaxation pro- 
cedure produced an increment in that 
response as compared to the nonrelaxed 
condition. The results are also consistent 
with the Hull-Spence hypothesis of de- 
crease in drive leading to decrease in 
Ea — E, in conditions where response 
strength of the competing responses is 
above threshold. Where drive levels are 
lower than those used in this experiment, 
or where the strength of the competing 
response is less, it is possible that a 
reduction in drive would cause competing 
response strengths to go below threshold 
and the facilitation of competing re- 
sponses found in this study may not be 
observed. 

There are, of course, alternative ex- 
planations that might be put forth to 
account for the obtained effects. These 
are inadequate attention to the task by 
the relaxed Ss, and different drive stimuli 
(Sp) associated with the different drive 
levels at testing. 

Very possibly the relaxation proce- 
dures could reduce S's attentiveness to 
the task. However, if attention were 
inadequate owing to sources such as 
reduced concern about performance or 
S's bordering on falling asleep, this would 
affect most the condition where the 
dominant response was the Stay response 
than when it was the Move response. 
There was no significant difference found 
between these conditions as indicated by 
the nonsignificant Response Type effect. 
Moreover, the presence of at least a 
minimally adequate attention level was 
indicated by the fact that trials in which 
a response could not be distinctly scored 
as a Move or a Stay response were a 
rarity and those that did occur were 
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almost equally scattered over the four 
groups. 

The criticism that the experiment did 
not control for possible differences in Sp 
at the different drive levels does not 
appear to be particularly critical for 
interpretation of the results since the 
conditions of reinforcement remained the 
same for each S throughout both the 
training and test sessions. With this 
procedure, responses early in the test 
trials would be affected most by changes 
in Sp, and the effect would be expected 
to diminish as relearning under the new 
Sp occurred with continued trials. This 
is because the reinforcement probabilities 
would become effective with the con- 
tinued trials and would be expected to 
produce a recovery and possibly an in- 
crease in the strength of the dominant 
response, The present results, on the 
contrary, show a tendency for a con- 
tinuing decrement in strength of the 
dominant response. This explanation 
would pertain whether Sp changes were 
attributed to changes in drive level 
through relaxation exercises or through 
the possibility that Ss interpreted the 
task in the test session to be different 
in some way in spite of having been told 
by E that it was to be “identical.” 

The further question of whether 
verbal instruction alone would have been 
as effective as the complete relaxation 
procedure used is indicated by evidence 
that verbal instruction or information 
can reduce the strength of a conditioned 
response as a function of reduced drive 
(Lindley & Moyer, 1961 ; Wickens, Allen, 
& Hill, 1963). However, these studies 
involved considerably higher drive levels 
than in the present experiment. It seems 
reasonable to expect that in this experi- 
ment the effects of verbal instruction and 
actual physical relaxation combined to 
Produce low drive, A tentative indica- 
ton that Ss given the relaxation treat- 
ment felt more relaxed in the second than 
in the first session, and that there was no 
measurable change in the control groups 
was given by the data from the sub- 
Jective ratings of relaxation, The avail- 
able data are not adequate to satis- 
factorily answer the question of the 
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relative effectiveness of verbal and 
physical factors. This still remains to 
be investigated. 

Several studies have reported that 
reduction in drive produces a decrement 
in a response (Grings & Lockhart, 1963; 
Lindley & Moyer, 1961; Wickens et al 
1963). Others have demonstrated that 
relaxed conditions yield a greater vari- 
ability of responses to stimuli (Horton, 
Marlowe, & Crowne, 1963; Siipola, 
Walker, & Kolb, 1955; Usdansky & 
Chapman, 1960). However, most of this 
Work was not concerned with establishing 
an especially low level of drive. Further- 
More, little or no emphasis has been 
given to the specification and control of 
the competing response, which is of 
central importance if the primary con- 
cern is the facilitation of such responses. 
The present study differed from previous 
research in its attempt to build in specific 
dominant and competing responses of 
given strengths and to investigate the 
Probabilities of these responses as a 
function of reduction of drive within the 
range of low drives. This experimental 
eimi gives a sound base for the con- 
s Sion that lower than normal drive 

ates facilitate competing responses. 
ex results thus provide support for the 
j^ Sting views on the important role of 

W drive in the modification of behavior. 
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UCS PROPERTIES IN CLASSICAL CONDITIONING 


OF THE ALBINO RABBIT'S NICTITATING 
MEMBRANE RESPONSE? 


ALFRED BRUNER? 
University of California School of Medicine, Los Angeles 


The effects of light onset and corneal puff as UCSs were investigated. 
In Experiment I a group receiving 100% light UCS trials failed to con- 
dition, contrary to contiguity theory expectations, since UCRs were 
reliably elicited, while a 100% puff group showed clear conditioning. 
Another group, receiving 50% puff and 50% light UCSs (50/50-PL), 
was superior to a conventional 50% puff group (50-P), the light ap- 
parently mediating puff effectiveness. In Experiment II a group 
receiving 50% puff and 50% vibrotactile UCSs performed midway 
between Groups 50/50-PL and 50-P asymptotically. Light onset was 
not reinforcing, but instead depressed responding below the normal 


extinction level in a group switched from 100% puff to 100% light UCS 


trials. 
between-session recovery. 


Among classical conditioning (CC) 
studies relevant to the contiguity- 
reinforcement issue are those which 
have attempted to condition the 
pupillary reflex using as the UCS 
either a change in illumination or 
shock (Kimble, 1961, p. 52). These 
studies have shown that reliable 
pupillary CRs (dilation or constric- 
tion) are not observed when the UCS 
is a change in illumination, while 
reliable CRs (dilation) are observed 
when the UCS is shock. One such 
study suggested that the “reinforce- 
ment characteristic" of shock facili- 
tates conditioning, while light, “unless 
the intensity is very high, is not 
considered to have intrinsic reinforce- 


1 Based on a thesis submitted in partial 
fulfillment of the requirements for the PhD 
degree, Indiana University, 1963. The re- 
search was performed during my tenure as a 
United States Public Health Service pre- 
doctoral fellow and supported in part by 
National Science Foundation Grant GB-145 
to I. Gormezano, to whom I wish to express 
my gratitude for his advice and assistance 
throughout the course of this research. 

?Now at the University of Southern 
California. 


All 50% groups exhibited within-session decrements and 


ment properties [Gerall, Sampson, & 
Boslov, 1957, p. 468]." Since both 
shock and light offset elicit dilation 
UCRs, the use of either as the UCS in 
the presence of a neutral stimulus 
satisfies the essential conditions of the 
traditional notions of stimulus-re- 
sponse contiguity. Consequently the 
studies of pupillary conditioning are 
taken to support a reinforcement 
interpretation of CC. 

The present studies parallel those 
on pupillary conditioning in that light 
was employed as a UCS and its effects 
compared to a known reinforcing 
stimulus (corneal puff). In the pres- 
ent instance the response under study 
was nictitating membrane extension 
in the albino rabbit, whose condition- 
ing with a corneal puff UCS has been 
reported previously (e.g., Gormezano, 


Schneiderman, Deaux, & Fuentes, 
1962). 


ExPERIMENT I 
Method. 


Subjecis.—The Ss were 32 naive, male and 
female albino rabbits, 85 to 100 days old, 
procured from Windsor Biology Gardens, 
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Bloomington, Indiana. The animals were 
housed in individual living cages with free 
access to food and water. 

Apparatus.—The apparatus, potentiom- 
eter-polygraph recording system, and method 
of restraint have been described in detail 
previously (c.g., Gormezano, 1965). Four Ss 
were run simultaneously, each being re- 
strained in a Plexiglas holder within a venti- 
lated refrigerator shell. Continuous indirect 
illumination was provided by a 6-w., 120-v. 
bulb located 1 ft. above S, adjacent to which 
was a 200-w., 120-v. incandescent lamp which, 
When illuminated, served as à UCS for some 
groups. 

'The CS was a 3,500-cps tone (72 db. SPL, 
-0002 ref.) presented through a speaker 5 in. 
in front of S. The puff UCS was compressed 
nitrogen under sufficient pressure to support 
an 80-mm. column of mercury as measured at 
theeye. Electronic timers were set to provide 
a CS duration of 900 msec. and a UCS 
duration of 400 msec. Onset of the UCS 
Occurred 500 msec. after CS onset and the 
two stimuli terminated together. These 
timing specifications are accurate for trials 
on which the puff UCS was used. On light 
UCS trials, however, the CS-UCS interval 
was about 600 msec., rather than 500 as on 
Ru trials, due to the relatively slow rise time 

the 200-w. lamp, which was judged to 
aoe an effective UCS about 100 msec. 
ie. application of current. The intertrial 
E rvals were 15, 25, and 35 sec. in a random 
ined! with a mean of 25 sec. A CR was de- 
Wer I. an upward deflection of the response 

in mm. or greater from the record base 

€ between 50 and 500 msec. after CS onset. 
gioi ante The Ss were divided into four 
Mas Ps of eight animals each. Group 100-P 
cons a conventional CC group receiving a 
Send puff as the UCS on all acquisition 
the ui Group 100-L received light onset as 
x CS on all acquisition trials; Group 50-P, 
grog ventional partial-reinforcement (PR) 
tio P, received a puff on 50% of the acquisi- 

n trials and no UCS on the other 50%; 

poup 50/50-PL received a puff UCS on 50% 
on the acquisition trials and the light UCS 

i te other 50%.3 The randomized sequence 
aa on which puffs were given was the 

T for Groups 50-P and 50/50-PL. 

Ollowing two adaptation sessions, 


3 In 
anim 


during 


a pilot study a grouP of eight 


nals treated almost identically to Group 

50/50 pa failed to develop CRs- Group 
Potenti L was therefore devised to provide a 
i hus Sly more sensitive measure of the 
ERPs effects, 
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which spontaneous responses were recorded 
all Ss received 70 acquisition trials per day 
for 10 consecutive days. On the day following 
completion of acquisition 200 extinction trials 
were given. On the next day 200 additional 
nonreinforced trials were given and this day 
will be designated spontaneous recovery. 


Results 


Since both the light and puff UCRs 
involved nictitating membrane ex- 
tension (closure), their recorded to- 
pographies appeared quite similar, 
with the light UCR usually having a 
somewhat longer latency (100 msec. 
vs. 50 msec.) and slower rise time than 
the puff UCR. Neither UCR evi- 
denced an opening (orienting) com- 
ponent, nor did the UCRs differ 
noticeably between groups or over 
trials. 

All statistical analyses were per- 
formed on the arcsin transform of 
percentage CRs over 10-trial blocks. 
By convention, however, the figures 
are presented in terms of nontrans- 
formed percentage CRs. Figure 1 
presents each group’s mean percentage 
CRs plotted over 70-trial blocks dur- 
acquisition and 40-trial blocks 
during extinction and spontaneous 
recovery. During adaptation, re- 
sponding did not exceed the spontane- 
ous blink rate typical for rabbits (two 
or three per hour) and was therefore 
too low to be plotted clearly in Fig. 1. 

Figure 1 indicates that Group 100-L 
does not appear to demonstrate any 
evidence of conditioning, although 
UCRs were reliably elicited by the 
light throughout acquisition. Group 
100-L was consequently excluded from 
the data analyses to avoid error under- 
estimation. An analysis of variance 
over all acquisition revealed significant 
differences between groups, F (2, 21) 
— 5.21 (the .05 significance level is 
utilized throughout the report), and 
individual ż-test comparisons indicated 
significant differences between Groups 


ing 
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100-P and 50/50-PL (t = 2.17) and 
Groups 100-P and 50-P (é = 3.21; 
two-tailed ¢ critical = 2.03 for both 
comparisons). The chance likelihood 
of the difference between Groups 
50/50-PL and 50-P over the last two 
acquisition sessions was p = .06 
(Fisher median test, Siegel, 1956). 
On the right side of Fig. 1 are the 
performance curves for extinction and 
spontaneous recovery, where, as is 
often the case in extinction data, there 
was considerable individual vari- 
ability. Thus although Group 50/ 
50-PL’s response strength appears 
superior during both these sessions, 
1 tests revealed that the only signifi- 
cant difference occurred during spon- 
taneous recovery and was between 
Groups 50/50-PL and 50-P, ¢ (14) 
= 1.81. No adjustments were made 
for differences between acquisition 
asymptotes in these comparisons. 
As in some previous studies (e.g., 
Grant & Schipper, 1952), the PR 
group (50-P) shows a slight increase 
in responding from the end of acquisi- 
tion to the first data point of extinc- 
tion, while the 100% group (100-P) 
shows a rapid, marked decrease. 
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Thereafter these two groups perform 
similarly. Although the extinction 
data are collapsed into 40-trial blocks 
the early extinction findings are not 
obscured, since the initial points 
shown also accurately reflect the 
approximate percentages observed 
when plotted over 5-trial blocks. 

Figure 2 displays the within-session 
(10-trial blocks) acquisition perform- 
ances of Groups 100-P, 50-P, and 
50/50-PL. As is apparent the curves 
of 100-P tend to be U shaped over the 
last four acquisition sessions, while 
those of the PR groups, especially 
50-P, tend to show a progressive re- 
sponse decrement as the session pro- 
ceeds, following an initial high level of 
responding relative to the end of the 
preceding day. The analysis of vari- 
ance over all acquisition revealed a 
significant effect for 10-trial blocks, 
F (6, 126) — 2.59, indicating that the 
within-session performance changes 
shown were reliable. 

Analyses of variance on the final 
two acquisition Sessions, extinction, 
and Spontaneous recovery revealed no 
significant differences between groups 
in mean response latency. The reader 
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Fic. 2. Group mean 


percentage CRs over 10-trial blocks showing a separate 


curve for each of the 10 days of acquisition. 


is referred to Bruner (1963) for 
further details of the CR and UCR 


latency analyses. 


Discussion 


Group 100-L’s failure to condition 
Corresponds to the finding of Gerall et al. 
Q957) mentioned earlier, in which con- 
ene between CS, light UCS, and UCR 
ate not sufficient to bring about detect- 

e CRs, In contrast when the present 
wes Was a corneal puff (or when Gerall's 
Both was shock) CRs readily appeared. 
ni the present UCSs (light and puff) 
ac re clearly defined and reliable in 

currence. Had a more intense light 
Gan used CRs might have developed in 
nue 100-L, for when UCS intensity is 
in so is strength of conditioning 
E8» Ross & Hunter, 1950). But it is 
ee that increasing the lights in- 
$ nsity may increase its aversiveness as 
vell as its perceptibility- Accordingly, 

atson's (1916) early claim of "some" 
papillary conditioning seems plausible in 
a of his describing the UCS as a “very 
rong light." Unfortunately there ap- 
peat to be no reported attempts to 
©plicate Watson's study, which by 
Present day standards was too brief and 
asual to constitute acceptable evidence. 
tw he within-session decrements and be- 
een-session recoveries exhibited by the 


PR groups suggest an inhibitory effect 
of nonreinforcement which cumulated 
within sessions and dissipated between 
them. The cumulative decremental ef- 
fects of PR have been described previ- 
ously (e.g., Ross, 1959), but since seldom 
is more than one session employed there 
are very few previous indications of the 
dissipation effect (Grant, Riopelle, & 
Hake, 1950; Grant & Schipper, 1952; 
Humphreys, 1939). The latter three 
studies did employ a second session and 
did observe a rise in PR performance at 
the outset of the second session, sug- 
gesting a dissipation effect which was not 
demonstrated by 100% groups. It would 
therefore seem desirable to employ 
multiple sessions in future studies of PR 
inhibition. 

While the light did not bring about 
conditioning when presented as the sole 
UCS in Group 100-L, it appeared to 
enhance responding when interspersed 
among puff trials in Group 50/50-PL. 
The latter outcome could have been the 
result of the interspersing procedure, per 
se, unique properties of the light, or both. 


EXPERIMENT II 


This experiment sought to extend 
the findings of Exp. I by testing (a) 
whether a stimulus which does not 
elicit membrane extension would facil- 
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Group mean percentage CRs for 70-trial blocks in acquisition and 40-trial blocks 


in extinction and spontaneous recovery for Exp. II. 


itate conditioning when interspersed 
among puff trials as the light tended 
to, and (b) whether the light would 
maintain or enhance responding when 
it followed a series of puff trials as it 
had when interspersed among them. 


Method 


'The method for Exp. II is identical to that 
for Exp. I except for the points noted below. 
Subjects.—The Ss were 20 albino rabbits. 
Apparatus.—The only additional piece of 
apparatus used in Exp. II was a vibrotactile 
stimulator, 23 mm. diameter, 33 mm. high, 
47.5 gm., constructed according to instruc- 
tions by Bice (1961). Originally a small 
earphone assembly, the vibrator was firmly 
snap-fastened to the velcro strap passing over 
the animal's head allowing its 60-cps vibra- 
tions to be mechanically transmitted to the 
top center of the rabbit's head. Pilot work in 
which the vibrator was used as a CS indicated 
that it served as a clearly perceptible stimulus 
for the rabbit. o. 
Procedure. —The animals were divided into 
two groups of eight each and one group of 
four. Group 100-P/L (eight Ss) received 10 
daily 70-trial sessions of 100% puff reinforce- 
ment during acquisition. The 200 extinction 
and 200 spontaneous recovery trials were 
modified for Group 100-P/L in that instead 
of presenting the CS alone, the light was in- 
troduced as a new UCS and the puff witheld. 
Group 50/50-PV (eight Ss) received the puff 
on half the acquisition trials and the vibrator 


on the other half in the same sequence used 
for 50/50-PL earlier, and received conven- 
tional extinction (i.e. CS alone). Group 100-V 
(four Ss) received the vibrotactile stimulus in 
place of a UCS on all acquisition trials fol- 
lowed by conventional extinction. 


Results and Discussion 


Figure 3 presents mean percentage 
CRs for each group plotted over 70- 
trial blocks in acquisition and 40-trial 
blocks in extinction and spontaneous 
recovery. During acquisition Group 
100-P/L performed much like the 
previous 100% group both within (not 
shown) and over sessions (Fig. 3). 
Group 50/50-PV attained an asymp- 
tote at about the 50% level of re- 
sponding, midway between the asymp- 
totes of Group 50/50-PL (64%) and 
50-P (40%) of Exp. I (cf. Fig. 1 and 
3), and also manifested the within- 
session PR decrement. Group 100-V 
showed no indication of conditioning 
although one atypical rabbit was re- 
sponsible for a few elevated points 
shown in Fig. 3. An analysis of vari- 
ance over all acquisition for Groups 
100-P/L and 50/50-PV revealed a 
significant F (1, 14) = 7.27 for groups. 

On comparing the three PR groups 
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of both experiments, Group 50/50- 
PV's intermediate asymptote suggests 
that the interspersing procedure in 
itself facilitated conditioning some- 
what, while the light as the inter- 
spersed UCS enhanced it even further. 
These effects are not clear statistically, 
however, as individual ¢-test compari- 
sons failed to indicate (.30 > P > 25) 
that Group 50/50-PV was significantly 
different from either of the earlier PR 
groups over the last 2 days of acquisi- 
tion. During extinction and spon- 
taneous recovery, however, Group 
50/50-PV's resistance to extinction 
was about the same as 50-P's (cf. 
Fig. 1 and 3) and significantly less 
than 50/50-PL's, # (14) = 2.09 for 
extinction, and 1.97 for spontaneous 
Tecovery, implying that unique prop- 
erties of the light were responsible for 
the boosted response strength of 
Group 50/50-PL. 

As contrasted to Group 100-P 
(Fig. 1), Fig. 3 shows Group 100-P/L's 
Pi aae to be greatly depressed 
owing the first 40 trials of “extinc- 
ge where the light replaced the puff 
fo the UCS. The difference in per- 
por iem n Groups 100-P and 100- 
ext during the last 160 trials of 

‘nection was compared by the 
i eg median test (Siegel, 1956), 
P Mori revealed a p = .06. On the 
a al day in Fig. 3 Group 100-P/L 
ain shows a reduced CR frequency 
allowing some spontaneous recovery 

the initial data point. 
e aradoxically there appear to be 
= © conflicting findings with regard to 
i € effects of the light UC s. When 
Nterspersed among puff trials the 
a tended to enhance response 
Uae But when presented as the 
in following puff conditioning the 
T6 t markedly reduced CR frequency 
Mies to regular extinction. | o 
& Ore closely examine the individual 

ects of the light, puff, and vibrator, 
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TABLE 1 
Tueta (0) VALUES REFLECTING EXTENT AND 
DIRECTION OF CHANGES IN RESPONSE 
PROBABILITY FOLLOWING PUFF 
AND NONPUFF TRIALS 


ð on Trial x +1 
UCS Condition 


on Trial » 

50/50-PL | 50/50-PV 50-P 
Puff .236 .220 .085 
No puff —"123 | —208 | —.124 


Note.— The minus sign indicates a negative response 
tendency. 
a sequential analysis was performed 
on the data of the PR groups' last 
two acquisition sessions. This en- 
tailed frequency counts of CRs on 
each trial depending on the occurrence 
or nonoccurrence of a puff on the pre- 
ceding trial. The difference in fre- 
quency of shifting from one response 
state (CR or no CR) to another, over 
pairs of trials, was corrected for 
overall response probability and util- 
ized as an index (0) of the average 
change in response probability over 
trials following administration of the 
particular UCS being examined.* 

The 6s reflecting changes in re- 
sponse probability on Trial z + lasa 
function of whether or not a puff was 
given on Trial n are presented in Table 
1 for the PR groups for the last two 
ns. As the data of 


acquisition sessio } 
Table 1 suggest, the sequential anal- 


ysis provided two main findings: (@) 
Following a puff on Trial 2, response 
probability was boosted in all three 
PR groups on Trial n + 1, but was 
clearly higher in the two 50/50 groups 


whose values were similar. (b) Fol- 
however, the 


lowing nonpuff trials, 
three PR groups manifested a tend- 


4 The resultant ratio, 6, is commonly em- 
ployed in mathematical learning experiments 
as a learning rate parameter. The complete 
sequential analysis and corresponding for- 
mulae are presented in Bruner (1964). 
Thanks are due to James J. Greeno for his 
assistance on this analysis. 
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ency to not respond, which was 
greatest in Group 50/50-PV with 
Groups 50/50-PL and 50/P being 
about equal. Together these findings 
suggest that the immediate effect of 
the substitute UCS was to depress 
responding (Finding b), while at 
the same time serving indirectly to 
support responding by somehow main- 
taining or mediating the puff's effect- 
iveness at a level above that attribut- 
able to UCS omission (Finding a). 
Finding a indicates that puff effective- 
ness was maintained to an equal 
extent by both light and vibrator, 
implying that the properties of the 
substitute UCS did not constitute an 
important variable in producing the 
mediation effect. But the properties 
of the substitute UCS seem more im- 
portant in the extent to which it 
detracts from response strength: the 
different amounts of response decre- 
ment following light and vibrator 
trialsappears to be themajor difference 
observed between these two stimuli 
(Finding b). The similar negative 0s 
shown in Table 1 following light and 
omission trials (50/50-PL and 50-P, 
respectively) indicate that these two 
conditions produced equal amounts of 
decrement in the interspersing pro- 
cedure. However, the light's decre- 
mental effect on the depressed “extine- 
tion” responding of Group 100-P/L 
was greater than that attributable to 
puff omission. 

One hypothesis suggested by this 
discrepancy is that the interspersing 
procedure facilitated transfer (gen- 
eralization) of stimulus properties 
from the puff to the light which was 
sufficient to mitigate the light's decre- 
mental effect somewhat, but was not 
sufficient to boost the light's 6 up to 
a positive value. This transfer would 
not be expected to occur so readily in 
the interspersed vibrator group how- 
ever, because the vibrator had fewer 
properties in common with the puff 
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than the light had (e.g., both puff and 
light elicited membrane extension). 
Hence the decrement produced by the 
vibrator was not mitigated to the 
extent of that produced by the light. 
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MONOPTIC AND DICHOPTIC VISUAL MASKING 
BY PATTERNS AND FLASHES! 
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This study investigated masking of letters by a bright flash of light or 


by a pattern. 
a monoptic effect; m: 
dichoptic conditions; ( 


The results showed that (a) masking by flash is primarily 
asking by pattern occurs under monoptic and 
b) increasing the interstimulus interval decreases 


masking by pattern less than by monoptically presented flash; (c) 


repetitior 
Different processes ar 
masking stimuli. 


When two visual stimuli are pre- 
sented in rapid succession, perceptual 
interference may result. This effect 
has been studied most extensively 
With respect to “backward” masking, 
when interference with perception of 
à first stimulus is studied as a function 
Of the second stimulus in the pair 
(Raab, 1963). A number of mechan- 
sms for this phenomenon have been 
Proposed (Alpern, 1953; Boynton, 
ge Exner, 1868; Kolers, 1962; 

gler, 1910; Werner, 1935). 
Cages authors (Kolers & Rosner, 
Wer ; Schiller & Wiener, 1963; 
Bier at 1940) have reported that 
iaces a B a visual pattern can be 
Stim ri either monoptically (both 
tically. to the same eye) or dichop- 
eyes). (the two stimuli to separate 
alio 4 when the masking stimulus is 
S visual pattern. Masking also 

curs when a visual form is followed 
Y a stimulus having essentially no 
m characteristics, such as a bright 

of light (Lindsley, 1961). 
dete, € aim of this study was to 
fer E whether masking by pat- 
and masking by flash reflect 

1 
x d by Mental 
E Postdoctoral Fellowship FPD-11, 589 

rant a anal Institutes of Health Training 
TIGM-1064. The author Is in- 


debt : 
Suggest ans Lukas Teuber for his helpful 


Th; 
his research was supporte 


1 of trials decreases masking by pattern but not by flash. 
e involved when flashes and patterns are used as 


identical processes or whether differ- 
ent mechanisms are involved. That 
the two might differ is suggested by 
the observation that masking by flash 
seems to be primarily a monoptic 
effect (Lindsley, 1961 ; Sperling, 1964), 
while masking by pattern has been 
shown to take place dichoptically. 
Accordingly, three experiments were 
performed. In the first, the extent of 
monoptic and dichoptic masking was 
investigated as a function of inter- 
stimulus interval in order to see if 
this temporal variable affects masking 
by pattern and flash in the same or 
different manner. In the second ex- 
periment, an analogous comparison 
was made with respect to the role of 
practice. In the third experiment, 
the flash and the pattern were pre- 
sented together in various combina- 
tions in order to make inferences 
about the relationship between these 
two masking situations. 


EXPERIMENT I 


Since masking depends on the 
temporal relationship between two 
succeeding stimuli, the central vari- 
able in the investigation of this 
phenomenon has been the duration of 
the interval between the two stimuli, 
here referred to as the interstimulus 
interval (ISI). The extent of “back- 
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ward” masking is a complex function 
of the duration of the ISI. In some 
cases masking declines steadily with 
increasing ISI; in others it increases 
at first and then declines (Alpern, 
1953; Kolers, 1962). Furthermore, 
masking studies have reported a wide 
range of ISIs at which maximal 
interference is observed. The under- 
lying variables have not been clearly 
established, although some mechan- 
isms have been proposed (Kolers, 
1962). 

This experiment investigated mon- 
optic and dichoptic visual masking 
with a pattern or a flash as masking 
stimuli. The extent of masking was 
determined for different durations 
of ISI. 


Method 


Subjects—The Ss were four paid volun- 
teers with equal visual acuity for both eyes 
(20/20 for all Ss). These were highly trained 
Ss who had Participated in several other 
experiments on visual masking. Trained Ss 
were used in order to minimize Practice 
effects (Schiller & Wiener, 1963). 

Apparatus.—The apparatus was a five- 
field tachistoscopic viewing box similar to the 
one described in a previous study (Schiller 
& Wiener, 1963). The electronic programmer 
was built by Sky Instrument Company. The 
light sources were cold-cathode mercury-argon 
lamps coated with magnesium-tungstate 
phosphor. For a more detailed description 
of programmer and lamps, see Kolers (1962). 

Materials.—Test stimuli: The stimuli to 
be detected were letters of the alphabet (r, I, 
J, M, Q, and w were omitted). These letters 
were contact photographs (Matte finish) of 
No. 118, 24-pt. Future Medium "Lettraset" 
letters. Each letter appeared in the center 
ofa 5 X 5 in. white card which was mounted 
on a plastic (Bakelite) plaque used for in- 
sertion in the viewing box. 


EXT 
T 


a b 


Fic. 1. The stimuli. (a) Sample ofa test 
letter. (b) The pattern masking stimulus in 
Exp. I, II, and IIT, 


Masking stimuli: Two masking stimuli 
were used: (a) a pattern (see Fig. 1) the arca 
of which was 5095 white and 5095 black; 
(b) a bright flash of light measuring 3.5 foot- 
candles (ftc.) at the eyepiece. This light was 
obtained by increasing the luminance of the 
light sources in the proper field. This light 
was reflected from a blank white card mounted 
ona stimulus plaque. The usual illumination 
of each of the fields with blank cards as 
stimuli was .14 ftc. at the eyepiece. The 
reflectance of the white surfaces used was 
90%; the black surfaces had a reflectance 
of 9%, 

The Ss looking into the viewing box saw a 
white circular surface (visual angle 7°6'). 
All stimuli were Presented foveally, i.e., at the 
fixation point designated by a small red cross 
(visual angle 3’) in the center of the field. The 
visual angle of the letters was 30’; the pattern 
subtended an angle of 1°12’, and the flash an 
angle of 7°6’, A larger visual angle was used 
for the flash in order to minimize possible 
figural effects due to closeness of borders 
between the masked and masking stimuli. 

Procedure.—Recognition thresholds were 
obtained for the singly presented test stimuli 
(letters) when followed by (a) no masking 
(N), (b) a flash to the same eye (FM), (c) a 
flash to the opposite eye (FD), (d) a pattern 
to the same eye (PM), and (e) a pattern to the 
opposite eye (PD). Four different ISIs were 
employed: 3, 20, 40, and 60 msec. For each 
of four Ss, 16 threshold measurements were 
obtained for each of the ISI and masking 
conditions. The Ss participated in four test- 
ing sessions. Each session began by taking 
two threshold measurements for warm-up 
using numbers as test stimuli but not followed 
by masking stimuli, During the rest of each 
session, four successive threshold measure- 
ments were taken (using letters) for each of 
the conditions. The order of test stimuli and 
conditions was random. 

The masking stimuli, which always ap- 
Peared after the test stimuli, were exposed for 
100 msec. under all conditions. Recognition 
thresholds for the test stimuli were obtained 
by the method of ascending limits and were 


Onset of the maski 
ISI, the focus fiel 

The interval 
approximately 3 sec, 
saying “ready” before each exposure. All Ss 
were asked to relax briefly while conditions 


ere instructed to make 
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Monoptic presentation 


flash 
pattern 
no masking 


o——o 


40 oA 


thresholds 


econds 


Recogniti 


3 20 


Interstimulus interval in milliseconds 


Fie. 2. 


their best guesses. Each session lasted ap- 
Proximately 1 hr. 

re n: eye not receiving the masking stimulus 

ilia esed to a blank field under the same 

the illt ation as that of the focus field. Thus 

sate to the eyes remained constant 
Bor or the appearance of a letter and/or 

pattern, 


Results 


Sante recognition threshold was the 
stimul exposure duration of the test 
ed us at which Ss correctly identi- 
amie ¢ letter. Since .Ss showed the 
àre rends, only the combined data 
Presented here. 
the = mean recognition thresholds for 
ne ations of this study are pre- 
diona Fig. 2. The SDs (not 
FM n) averaged 1-3 msec. for the N, 
$ed dm FD conditions, and 2-6 
- for the PM and PD conditions. 
€ results show that masking with 
Pattern occurred both mon- 


eus 
ten ally and dichoptically. The ex- 
dich, of this masking was slightly less 


mea, tically, This difference for the 
Was thresholds, although consistent, 
t dese; significant for the four Ss on 
WO, = 1.76, 1.41, 1.60, 1.31). 
ri th the flash, masking occurred 
arily on monoptic presentation. 


sli 
ight masking effect was also found 


Dichoptic presentation 


40 


Interstimulus interval in milliseconds 


Mean recognition thresholds for the conditions of Exp. I. 


dichoptically with an ISI of 3 msec. 
The differences between DF and N 
conditions at an ISI of 3 msec. were 
significant for three of the four Ss 
(t = 5.17, 3.30, 2.08, 1.53). 

The extent of monoptic masking by 
flash or pattern (FM or PM) de- 
creased with increasing ISI. The rate 
of decrease, however, was greater for 
Cond. FM than PM. An analysis 
of variance comparing FM and PM 
at the four ISI durations showed 
significant interaction between ISI 
and the masking stimulus employed 
(F = 17.22, p < .01). 


EXPERIMENT Il 


The aim of the second experiment 
was to explore further the differences 
observed in visual masking when 
either a pattern or a flash was used as 
masking stimuli. In an earlier study 
(Schiller & Wiener, 1963) there was 
evidence that with a pattern as a 
masking stimulus there were decre- 
ments in recognition threshold with 
practice. ls this practice effect com- 
mon to all masking situations or does 
it occur only with certain kinds of 
stimulus conditions? 
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Method 


Subjects —The Ss were eight paid volun- 
teers with equal visual acuity in both eyes. 
These were naive Ss who had not previously 
participated in a visual masking experiment. 
The Ss were divided in two groups. 

Procedure.—The same apparatus and ma- 
terials as in Exp. I were employed. All Ss 
participated in eight testing sessions. During 
each of these sessions, 10 threshold measure- 
ments were taken for each of five conditions. 
These were the same as in Exp. I. no masking 
(N), monoptic flash (FM), dichoptic flash 
(FD), monoptic pattern (PM), and dichoptic 
pattern (PD). Five successive threshold 
measurements were taken for each condition, 
with each condition appearing twice during a 
session. Orders of conditions and test stimuli 
were random. The general procedure was 
identical to the one used in Exp. I. 

For Group I, ISI was constant at 3 msec. 
For Group II, the ISI was always 20 msec. 


Results 


Since all the Ss showed the same 
trends, only the combined data are 
presented here. Figures 3 and 4 show 
the mean recognition thresholds for 
the two groups for the 8 days of 
testing. With an ISI of 3 msec., the 
SDs for the PD and PM conditions 
averaged 4-9 msec., and for the FD, 
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FM, and N conditions, 2-7 msec. 
With an ISI of 20 msec., the SDs for 
the PD and PM conditions averaged 
2-6 msec., and for the FD, FM, and 
N conditions, 2-4 msec. The SDs in 
this experiment are higher than in 
Exp. I; this is probably due to the 
fact that naive Ss participated in this 
experiment. 

The results show that decrement in 
threshold with the pattern was much 
greater than the decrement with the 
flash. Table 1 shows the mean im- 
provement in percent from Days 1 to 8 
for all conditions, 

The extent of masking by the 
pattern was somewhat greater mon- 
optically than dichoptically. This 
difference was significant for seven of 
the eight Ss according to ¢ tests 
(t = 8.40, 6.74, 6.50, 3.71, 3.59, 3.46, 
2.66, .85) for the overall means. 

The extent of masking by the pat- 
tern was less with an ISI of 20 msec. 
than with an ISI of 3 msec. This 
difference was much greater for the 
monoptically presented flash. This 
finding confirms those of Exp. I with 
respect to the greater rate of decre- 
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ment of masking by flashes as ISI is 
Increased. 


Experiment III 


In the third experiment various 
combinations of flash and pattern 
were employed in the masking situa- 
tion in a further attempt to assess the 
Processes underlying masking and the 
relationship between the two kinds of 
Masking situations. 


Method 


Subjects —Two Ss served in this st 
Oth were hi ined 

ghly trained. ae 
t B rocedure.—The same apparatus and ES 
rom were used. The Ss participated in va 
esting sessions, A constant ISI of 3 msec. 


udy; 


TABLE 1 
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for the eight sessions with an ISI of 20 msec.: Exp. II. 


was employed. There were nine conditions in 
this study. The first five were similar to the 
conditions used in Exp. Land II. (Cond. 1) N, 
(Cond. 2) FM, (Cond. 3) FD, (Cond. 4) PM, 
and (Cond. 5) PD. Conditions 6-9 employed 
various pairings of flash and pattern as 
follows: (Cond 6) FPM. The flash and pat- 
tern appeared together ; both were presented 
to the same eye as the test stimulus. (Cond. 
7) FPD. The flash and pattern appeared 
together ; both were presented opposite to the 
eye which received the test stimulus. (Cond. 
8) pD-FM. The flash was presented to the 
same eye as the test stimulus; at the same 
time, the pattern was exposed to the other eye. 
(Cond. 9) PM-FD. The pattern was pre- 
ed to the same eye as the test stimulus; 


sent 
at the same time, the flash was exposed to the 


other eye. 
Presenting the flash and pattern together 
FPM and FPD) had the effect of increasing 
the contrast of the pattern, since the more 
intense light sources were now reflected not 
from a blank card as in Cond. FM and FD 
but from the card containing the pattern. 


Results 

The results of this experiment are 
reported in Fig. 5. Since both Ss 
showed the same trends, their data 
were combined. The differences 
among conditions were analyzed by 
i tests. All comparisons yielded sig- 
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ion thresholds 


in milliseconds 


mean recogni 


Fie. 5. 


nificant differences at the .01 level for 
both Ss except for the following: FM 
and PM, FM and PM-FD, PM and 
PM-FD. The difference between N 
and FD was significant for only one 
of the Ss. Conditions 1-5 produced 
results similar to those found in Exp, 
land II. These were essentially con- 
trols to be compared with Cond. 6-9. 
Presenting flash and pattern together 
to one eye (Cond. FPM and FPD) 
significantly increased the masking 
effect. This occurred upon monoptic 
and dichoptic presentation, but mask- 
ing on monoptic presentation was 
significantly greater than on dichoptic. 

If the flash interferes with percep- 
tion of the test stimulus at a stage 
prior to that at which the two 
monocular fields are combined, it 
would be reasonable to assume that 
under the FM-PD condition the 
perceiver has to overcome first the 
interference due to the flash given to 
the same eye, and then the interfer- 
ence due to the pattern given to the 
other eye. Accordingly, a higher 
recognition threshold should be ob- 
tained with the FM-PD condition 
than with the FM and PD conditions. 
The results bear out this expectation. 
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Mean recognition thresholds for the conditions of Exp. III with an ISI of 3 msec. 


Discussion 


The results of this study show the 
following: 

1. Backward masking by a flash of 
light is primarily a monoptic effect; 
masking by pattern occurs under mon- 
optic and dichoptic conditions. 

2. Both masking by pattern and mask- 
ing by monoptically presented flash de- 
crease with increasing ISI. The rate of 
this decrease is greater for the flash. 

3. Repeated trials with patterns as 
masking stimuli lead to a decrement in 
the masking effect. Such “practice 
effects” are not obtained for flashes as 
masking stimuli. 

4. Pairing the flash and the pattern as 
masking stimuli produces an increase in 
the masking effect under both monoptic 
and dichoptic conditions as compared 


with effects of either flash or of pattern 
presented alone. 


5. Presenting the flash to one eye and 
the pattern to the other increases their 
combined masking effect, provided the 
flash is given under monoptic conditions 
but not if the flash is given under 
dichoptic conditions, 

These results suggest that there are 
two distinguishable processes of masking 
involved in this study. One of these (for 
flash) seems to occur in the central visual 
pathways prior to the mixing of the two 
monocular representations, The other 
(for pattern) occurs at levels where the 
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two monocular fields interact; the decline 
in masking by pattern on repeated trials 
further suggests complex processes of 
perception in which learning might play 
arole. The type of effect observed with 
the flash seems to be related to the 
interference effects reported by Boynton 
and Kandel (1957), who were apparently 
the first to introduce the term “masking” 
for this phenomenon. 

The visual masking effects reported 
here seem to be different from the inter- 
ference which takes place by metacon- 
trast (Alpern, 1953). In metacontrast 
interference with the first stimulus is 
minimal at short ISIs, reaches à maxi- 
mum between 75-100 msec., and then 
declines again. In the masking experi- 
ments reported here, interference de- 
clined steadily with increasing ISI. 

The results reported here show a small 
degree of masking with a dichoptic flash. 
The extent of this masking varied to some 
degree with Ss, and two practiced Ss did 
Not show a significant effect at all. On 
the basis of such variability it is difficult 
to draw any conclusions from these find- 
ings. It should be noted, however that 
small but reliable dichoptic effects, USINS 

ashes of light for both the masked and 
Being stimuli, have been reported by 
Kitersby and Wagman (1962) and 

andel (1958). 
ae extent of masking by pattern was 

ightly less dichoptically than mon- 
OPtically, This difference can apparently 

* magnified when the monoptic an 
rachoptie conditions, rather than being 
B ndomized as in this study, af Bre 
ented in separate series of trials (Schiller 
wl It is possible that 
When these conditions are not random- 
n under 
Nen conditions | slighty 
crea, ited with practices there y 

Sing the extent of masking- 
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APPLICATION OF A MARKOV MODEL TO FREE 


RECALL AND RECOGNITION ! 


WALTER KINTSCH anp CHARLES J. MORRIS 
University of Missouri 


In 2 experiments lists of nonsense syllables were learned by the methods 
of recognition and recall, In a 3rd experiment Ss first learned to 
recognize a list of nonsense syllables and then to recall it. The recogni- 
tion data could be described by a simple 2-state Markov model. A 
3-state Markov model was needed for the description of the recall data. 
It was hypothesized that recall learning involved 2 stages, a recognition 
stage and a 2nd stage where the response becomes available in the 
absence of the stimulus, and that each of these stages can be described 


as a simple Markov process. 


In Experiment III it was shown that re- 


call learning after recognition learning can actually be described by a 


2-state Markov model. 


The results of many studies con- 
ducted with the method of free recall 
and recognition permit the general- 
ization that recognition is a simpler 
process than recall. The low incidence 
of extralist intrusions in studies of 
free recall provides indirect evidence 
that recognition of list membership 
develops faster than recall (Deese, 
1959). When recognition and recall 
are employed as measures of reten- 
tion the former yields higher scores 
than tests of recall (Postman & Rau, 
1957). Apparently an S first learns 
to recognize an item and then he 
learns to recall it. Thus, learning by 
the method of free recall can be re- 
garded as a two-stage process. If one 
is willing to assume that learning in 
each stage is all-or-none, a simple 
model can be obtained for this two- 
stage process. This assumption ap- 
pears justifiable because learning in 
each stage is elementary, well-struc- 
tured, and well-controlled and because 
the application of Markov models to 
such learning tasks has been notably 
successful (Bower, 1961, 1962; Bower 
& Trabasso, 1963; Estes, 1960, 1961; 
Suppes & Ginsberg, 1963). 

1This study was supported by Research 
Grant GB-195 from the National Science 
Foundation. 


The strategy to be followed here 
consists in analyzing free recall into 
two one-stage learning processes. The 
model used to describe the elementary 
learning process is a simple two-state 
Markov model of the type used by 
Bower (1961). This model assumes 
a constant probability on every learn- 
ing trial that an item will move from 
the starting state to the learning state. 
The learning state of the first sub- 
process will be the starting state for 
the second subprocess. The proba- 
bility of a correct response in each 
state will depend both upon E's 
procedure and the response biases of 
S. In general, subprocesses will have 
different learning rates. 

Using the conventional notation 
for Markov processes the model for 
recognition learning is given by 


Probability 
of a correct 
Co Ci response 


Co [ (l—c) c 1—5 
Ci 0 1 1 

This model is identical with Bower's 
paired-associate model, except for the 
trivial modification that the process 


always starts in State C, in Bower's 
case, whereas in the typical recogni- 


[1] 
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tion experiment data are collected 
only after one learning trial has been 
given. Thus the expressions given 
by Bower (1961) must be adjusted 
slightly. They are reproduced in the 
Appendix. 

Free recall will be represented by a 
three-state Markov process. The sub- 
process of recognition is again de- 
scribed by Matrix 1. However, the 
probabilities of a correct response in 
Co and C; will be zero and (1 — 7) in 
this case. State Ci is the learning 
state for the recognition subprocess 
and the initial state for the recall sub- 
process. In this state an item can be 
recognized, but active reproduction 15 
not always possible. The recall 
process itself is also described by à 
transition matrix like Matrix 1, which 


gives as a total process 


Probability 
of a correct 
Co CG C response 
C[(1—-) c 0 0 
G| o (1-6) ? | i-r 

Cy 0 4 1 

[2] 

This three-state model is similar to 


the one used by Bower and Theios 
except 


(1964) and Kintsch (1963), 
hat allowance was made there for the 
oui that the learnin 
e second stage might be 
After a failure than after @ success: 
leon refinement was abandoned here 
or the sake of simplicity- 
Theoretical predictions from 
dee given in the Appendix. 
€ identical with the ones E 
Ower and Theios (19 
€ modification noted abo 


ven by 


t ; 
Mese expressions are COM sed wits 
experimental results, E 

missed 


Pa 

will on which all items are 
be included in the data- 

ide data 


fo Experiments 1 and II prov! T 
T a test of the recognition and reca 
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model. In Exp. lll the learning 
process is experimentally subdivided 
into a recognition and a recall phase. 


METHOD 


Subjects.—Fifty students from an intro- 
ductory psychology course served as Ss, 16 
each in Exp. I and II, and 20 in Exp. IH. The 
Ss were randomly assigned to the three ex- 
perimental groups. 

Materials.—Sixty nonsense syllables were 
selected from Noble's (1961) list. The 
association value of all syllables was between 
5 and .6. Each syllable was printed on a 
3 X 5 in. index card. 


Procedure 

Experiment I: Recognition.—Each S re- 
ceived a list of 15 nonsense syllables which 
were selected randomly from the pool de- 
scribed above. Each syllable was presented 
to S for 2 sec. After all syllables had been 
presented the cards were shuffled together 
with 15 syllables from the same pool which 
had not been shown before and were presented 
again. The S was now instructed to respond 
with “yes” or “no” to each card, depending 
upon whether he thought he had seen it 
before or not. This procedure was repeated 
until S reached a criterion of three successive 
correct recognition trials. All Ss were in- 
structed to spell out the syllables aloud as 
they were shown. The E recorded for each of 
the 15 original items whether it had been 
recognized. 

Experiment. II: Recall.— The Ss were given 
10-item lists to learn at the rate of approxi- 
mately one item every 2 sec. After each 

resentation S was given up to 120 sec. during 
which he attempted to recall the items 
verbally. Again, a criterion of three succes- 
sive correct trials was required. 

Experiment III: Recognition followed by 
recall.—Ten-item lists were employed in this 
experiment. First the list was learned to a 
criterion of three successive correct trials with 
the procedure of Exp. I The Ss were then 
instructed to try to recall as many items as 
they could. Further trials were given, follow- 
ing the procedure of Exp. II, until a criterion 
of three successive correct recalls was reached. 


RESULTS 


of learning stages.*—Ac- 


Analysis 
he model given by Ma- 


cording to t 
?Failures to recall or recognize will be 
referred to as errors. 
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trices 1 and 2 different stages of 
learning are characterized by constant 
response probabilities. In Exp. I the 
probability of a correct response 
should not change as long as an item 
is in State Co. Since an item is in the 
starting state at least until the trial of 
the last error, response probability 
should be constant on trials before the 
last error. To test this proposition 
sequences before the last error were 
divided into thirds and the errors in 
each third were recorded. They were 
8, 6, and 9, respectively, out of a total 
number of 46 observations in each 
third. A chi-square test for the sta- 
tionarity of this Vincent curve yielded 
a value of .786 which was obviously 
not significant. 

In Exp. II stationarity was expected 
in the intermediary state according to 
Matrix 2. Again it was impossible 
to know on exactly which trials the 
process was in the intermediary state, 
but all trials between the first success 
and the last error must belong to this 
state. Thus Vincentized sequences 
over these trials were examined for 
stationarity. Halves were used in- 
stead of thirds this time because not 
enough long sequences were available. 
The number of errors in the two halves 
were 21 and 20 out of 59 observations, 
with a corresponding nonsignificant 
chi square of .37. "This aspect of the 
recall data is therefore in agreement 
with the hypothesis that recall is a 
two-stage process. To show that re- 
call could not be described as a one- 
stage process, a Vincent curve was 
constructed in the manner of Exp. I. 
The hypothesis of stationarity had to 
be rejected for this curve. The pro- 

portion of errors was .61 and .43 in the 
two halves, x? = 9.349, p « .01. 

In Exp. III the two stages of recall 
were separated experimentally. Per- 
formance before the last error should 
therefore be stationary in each phase, 
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as it was in Exp. I. Learning in the 
recognition phase was, however, so 
fast that no meaningful tests of 
stationarity could be performed. In 
the recall phase the numbers of errors 
on trials previous to the last error 
were 17 and 20 for the first and second 
halves of these trials (out of 33 ob- 
servations in each half). The number 
of errors thus actually increased from 
the first to the second half. The dif- 
ference was, however, not statistically 
significant (x? — .553). 

The stationarity data are presented 
here as evidence that a Markov model 
can be used to describe recognition 
and recall learning. They are not in- 
tended to prove that learning was 
all-or-none or that no other interpre- 
tations are feasible, but merely to 
show that the very simple conceptual- 
ization in terms of Markov processes 
is in agreement with this aspect of the 
data. Another requirement of the 
Markov model, namely that trials 
before the last error be independent, 
was also met by the data. Chi-square 
tests of independence of successive 
trials yielded a value of 2.66 for Exp. 
I and .87 for the intermediary trials 
in Exp. II, both with df —1. In 
Exp. III a chi square of .44 was ob- 
tained for the recall phase (df = 1), 
while not enough data were available 
for a statistical test in the recognition 
phase. 


Fit of the Model 


Experiment I.—Further compari- 
sons of the model with the recognition 
data necessitated the estimation of the 
two parameters of the model. An 
intuitively reasonable estimate for P 
was available in the proportion of 
errors on trials before the last error, 
P = .214. An estimate for the condi- 
tioning parameter was obtained by 
setting the theoretical expression for 
the total number of errors equal to the 
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observed total number of errors, sub- 
stituting for p and solving for ¢. The 
value obtained in this manner was 
é = .282. The predictions from the 
model were obtained by a simple 
modification of the expressions given 
by Bower (1961). These are included 
in the Appendix. Using the estimates 
p and c, numerical predictions for 
various statistics of the data were 
then calculated. In general the fit 


between these predictions and the 
data was satisfactory. A few com- 
parisons are given in Table 1. 


Experiment II.—1n order to derive 
predictions from the recall model, the 
estimation of three parameters was 


TABLE 1 
PREDICTIONS AND OBSERVED STATI 
IN Expr. I 


STICS 


Observed Predicted 
pausan 


Mea 5 | x 
an number of errors E .92 
Mean trials before the first 8 

recognition 2 46 
Mean tri S au m 
"SD rial of last error 244 | 245 
Total number of error runs | -46 = 


Necessary, The proportion of errors 
On trials between the first recall and 
the last error served as a? estimate 
for» ¢ = .333. From Matrix 2 an 
expression was derived for the number 
Of times a success is followed by an 
€tror on the next trial. This expres 
Sion depends only upon 9 and upon r 

Y substituting ? into the theoretica 
€xpression and setting it equal to it$ 
Observed value, an equation was ob- 


tained and solved for 8. The obtaine 
estimate wae @ = .297. The estimate 
s in Exp- ls 


d € was then obtained à 
eng D and 6. It was fune Fs 
iu. -502. These estimates vee z 
: Obtain the predictions which a 
Ompared with the data in Table 2. 
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TABLE 2 
PREDICTIONS AND OBSERVED STATISTICS 
IN Exp. II 


Observed | Predicted 


| 


Mean number of errors 
SD 
Mean number of trials be- 
fore the first recall 


mo 
aH 
pue 
E 
- 


Mean trial of last error 
SD 
Total number of error runs 


gagian ES 
tno des 
NOEAS 
LACIE SNS 
Moku. 
$9295 2 


Experiment. III.—Phase I of this 
experiment was analyzed exactly like 
the recognition data in Exp. I. The 
estimation procedure yielded $ = .178 
and 6 = .372. Table 3 provides a 
comparison between observed and 
predicted statistics from which the 
goodness of fit of the model can be 
judged. 

The one-stage model applied to the 
recall phase of this experiment was 
somewhat different from the model 
used to describe recognition data. At 
the end of recognition learning several 
items were already available for recall, 
in spite of the fact that S was not 
instructed to recall items or was pre- 
viously tested for recall. The propor- 
tion of such items (A) was therefore 
another parameter of the one-stage 
model. The theoretical expressions 
derived from this modified model are 


TABLE 3 
PREDICTIONS AND OBSERVED STATISTICS 
1x RECOGNITION PHASE 
ix Exp. HI 


Observed | Predicted 


Mean number of errors 30 = 

SD : .56 .62 
Mean number of trials be- 
fore the first recogni- 


tion AT A3 

SD E 37 
Mean trial of last error 54 62 
D 1.12 1.52 
‘Total number of error runs .23 27 
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TABLE 4 


PREDICTIONS AND OBSERVED STATISTICS: 
RECALL PuasE oF Exp. III 


Observed | Predicted 


M ber or errors .62 — 
: E] M 1.00 1.01 
Mean number of trials be- 
fore the first recall 42 — 
D .80 -18 
Mean trial of last error .83 .81 
SD 1.45 1.39 
Total number of error runs 42 A3 


also included in the Appendix. A 
simple estimate for the proportion A 
and the conditioning parameter c was 
obtained by solving the two equations 
for the total number of errors and the 
number of errors before the first recall 
for c and X. In order to do this an 
estimate for p was first obtained in 
the usual manner. The resulting 
parameter estimates were $ = BUT, 
À = 487, and 6 = 474, A compari- 
son of the predictions obtained from 
this model with the data of the recall 
phase is presented in Table 4. It is 
interesting to note that the number of 
items available for recall at the be- 
ginning of Phase II was approximately 
equal to those learned in one regular 
recall trial. 

In applications of the model it was 
assumed that all items and all Ss were 
equivalent. Experiment IT] provides 
a crude test of this assumption, at 
least as far as the equivalence of Ss 
is concerned. If individual differ- 
ences were significant in this experi- 
ment slow learners in Phase I should 
also be slow learners in Phase II. 
According to the model, on the other 
hand, slow and fast learning of an 
item was regarded as a random process 
governed by one and the same condi- 
tioning rate. The correlation between 
number of trials to criterion in Phases 
I and II should therefore be zero. A 
rank-order correlation of .261 was 
actually obtained, which was not sig- 
nificantly different from zero. 
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Discussion 


The results of this experiment support 
the hypothesis that learning by the 
method of recall is a two-stage process, 
with learning to recognize as a first 
stage. Secondly, the stationarity data 
reported and the successful fit of the 
model demonstrate that it is possible to 
describe each stage as a two-state 
Markov process. Concerning this second 
point some experience from other experi- 
ments and some speculations must be 
reported. It would be quite easy to 
destroy the all-or-none character of the 
data. This can be achieved by using 
learning material which is so complex 
that even recognition learning involves 
more than one step. Low-association- 
value consonant trigrams will do for that 
Purpose. Also, the application of the 
present model is precluded whenever E 
unbalances the equivalence of the items 
to be learned. This can be done by al- 
lowing some items to be very different 
from others, by presenting the items 
always in the same order, or by any other 
means which ensure that some items will 
arouse more attention than others. 
What happens then is that certain items 
are set aside for a few trials while S 
concentrates, for example, on the first 
and last items of the list. Increase in list 
length might have the same effect. With 
long lists Ss apparently do not divide 
their attention evenly over all items, but 
single out some for learning and leave the 
others for later. A one-stage model 
obviously would be inadequate to de- 
scribe the learning of the latter items. 

The question of whether the recall 
learning in Exp. II or the recognition and 
recall learning in Exp. III was more 
efficient can not be answered unequiv- 
ocally. In terms of total number of 
errors made, the procedure used in Exp. 
IIT was much superior, However, such a 
direct Comparison is inadmissible be- 
cause the experimental operations were 
such as to ensure errors in the first stage 
in Exp. II but not during the same stage 
in Exp. III. comparison in terms of 
the total number of trials to complete 
learning in Exp. II and III revealed no 
difference. The mean number of trials 
to criterion was 9.4 and 11.1 in the two 
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experiments, £ (34) = 1.90, p > .05. It 
should be noted, however, that the score 
for Exp. III includes a total of six 
criterion trials (three successive correct 
trials in both phases of the experiment). 
Only three criterion trials are included in 
the score for Exp. II. 

The two stages which are postulated 
by the present model are recognition 
learning and recall. Recognition learn- 
ing consists in differentiating the items of 
a list, i.e., of encoding the stimulus. 
This encoding process establishes a re- 
sponse to an item when it is presented. 
In the beginning of the second phase the 
response is thus elicited by the sensory 
Stimulus, but it is not reliably available 
in the absence of the sensory stimulus. 
The second learning phase consists in 
making the response always available. 
This interpretation is in agreement with 
the different functions of recall and 
recognition as described by Postman and 
Rau (1957). It also has certain features 
in common with a stochastic model for 
free recall described by Waugh and Smith 
(1962). Corresponding to the two stages 
Which were identified here, Waugh dis- 
tinguishes between processes of labeling 
and fixation. Her model is also a Markov 
model, but somewhat more complex than 

he present model. The most important 

iflerence lies in the fact that Waugh 
allows for direct transitions from the 
Starting state to the learning state. This 
is incompatible with the idea of analyzing 
recall into a sequence of two one-step 
Processes, 

nae model for free recall by 
1955 ill (1952), (also Bush & ere 
i ) differs from the present mo 
Dterpretation as well as formal structure. 

* two models do, however, lead to 
Some common predictions. For instance, 

€Y predict a geometric Or nearly 
8cometric distribution of trials before the 

TSt recall (see also Murdock & Babick, 
1961) 

Although the present model accounts 
Pr some aspects of free-recall data, it 
Th not give a complete description. 
xt Persistent clustering of x m 

“recall data is the most SIE! 
Spect of the data which has been 
Neglected, 


Miller and 
Mosteller, 
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APPENDIX 
THEORETICAL PREDICTIONS 


A. Some derivations from the two-state 
model when observations are taken only 
after the first learning trial: 
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Distribution of number of errors per 
item, T: 


ET) = [e E c)b 
Var(T) = (E(T))? + E(T) 


Distribution of number of trials before 
the first success, J: 


b — e) 
1—p(1 =g) 

Var(J) = (E(J)} + E(J) 
Distribution of trial of last error, N: 


p — c) 
c(1 — (1 —)(1 — 2) 


Var(N) = E(N) 2e - (E(N)} 


EQ) = 


E(N) = 


Total number of error runs, R: 
E(R) = E(T)(1 — (1 — 9p). 


B. Derivations from the same model 
when already a proportion À of the items 
are in the learning state on Trial 1: 


Distribution of total number of errors 
per item, T: 


E(T) = eo) 


Var(T) = E(T) i — (E(T)} 
poo — 
1— (1 — c)(1 — p) 


Distribution of number of trials before 
the first success, J: 


i 
MUNI ET 
2 = 
B. =H | 
= BUI 
Distribution of trial of last error, N: 
pU —») 
e[t — (1— à — p)} 


Var(N) = E(N) = - (any 


Var(J) = E(J) 


E(N) 


Total number of error runs, R: 


E(R) = E(T)0 — (1 — 95. 


C. Derivations from the 
model: 


three-state 


Distribution of the total number of 
errors per item, T: 
1 r 
E(T)---t ó 
2—c 
c 
4: cb(1 — (1 — r)b) 
c—b 


2—b 2—c 
xaz- ts] 


— (ET) 


Var(T) = (1 — r)b 


tox cl. cam 
1-0-0907) 


Distribution of number of trials before 
the first success, J: 


E r 
EMEN S dar rdi 
2—c 

ce 


where b = 


Var(J) = (1 — r) 


cr(1 — a) 
traic 


l+a 2—c 
x{ ath 35! 


= (E(J))* 


where a = r(1 — 0) 


Distribution of trial of last error, N: 


zh r 
EUN) es c Tk (1 — 7)0) 
Var(N) = (1 - l s 
2—c 2—0 
x| eo | 


—(E(N)y 
Total number of error runs, R: 
r 
EQR)-g0—0-05»40-2 


Average number of times a success is 
followed by an error one trial later, d: 


zy oL fü - 00-96) 
MENS mM 
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SUPPLEMENTARY REPORT 


"FATE" OF LIST 1 R-S ASSOCIATIONS IN TRANSFER THEORY ! 


NORMA R. ELLINGTON anp DONALD H. KAUSLER 
St. Louis University 


Extinction of List 1 R-S associations during practice on List 2 was 
tested for the A-B, C-B transfer paradigm by means of the Barnes and 


Underwood (1959) modified recall method. 


It was found that both the 


R-S associations of List 1 and the stimuli themselves became increasingly 


unavailable for recall with increasing practice on List 2. 
2, however, became increa 


ciations and stimuli of List 


for recall with practice on List 2. A 
practice, indicated that the decrement 1n R- 1 
le forgetting occurring during List 2 practice. 


be attributed to simp 


Evidence for the extinction of List 1 A-B 
associations during the learning of List 2 A-C 
associations has been presente by_ both 
Barnes and Underwood (1959) and Goggin 
(1963). A similar phenomenon might be ex- 
Pected for the R-S associations of List 1 pairs 
in an A-B, C-B transfer paradigm in that the 
R-S associations of the two lists enter into an 
A-B, A-C relationship (Kausler & Kanoti, 
1963; Twedt & Underwood, 1959). In fact, 
Some evidence for the extinction of B-A (i.e. 
List 1 R-S associations) associations following 
Practice on List 2 C-B (S-R) pairs has Dem 
Presented by Keppel and Underwood (1962). 
tes Present study extended this area =| 
U estigation by replicating the Barnes an 
ee experiment on the C-B paradigm, 
x the extinction of B-A associations being 
ured by their modified free recall tech- 

ique after varying degrees of List 2 practice. 
dt Method.—Seventy-five Ss (undergraduate 
students in psychology) wert assigned alter- 
ately to four experimental groups and à 


y: * Deua, toa criterion of on 
wine ADM imental groups s, 
or 20 5 a familiarization trial) eithe 
Sibu trials on List 2 (C-B pairs). j 
solvi performed on a perceptua re 
fies task, in lieu of List 2, for a P f 
grou approximating that spent by 
Presented e E: Ecc Le 
i g :2-sec. rate, e 
mde el interval, “Three different serial 
Srs were employed for each list. 
sof lowing List 7 practice (or the problem- 
of bak task), R-S recall for the associations 
ioe lists was measured by presentine : 
à blank sheet of paper anda second sheet, 
frs This study is based on a thesis submitted DY ty 


uthor to th ‘School, St. 
Partial fulfillncent tor the "Master of Arts degree. 


The R-S asso- 
singly available 
A control group, without List 2 
S recall on List 1 could not 


which was available throughout the R-S recall, 
containing the responses of the two lists in 
alphabetical order. The Ss were instructed 
to write down all of the stimuli from the two 
lists they could remember and in whatever 
order they came to mind. No time limit was 
imposed for this task. They were then in- 
structed to write beside each stimulus the 
response with which it had been paired and 
also to indicate whether the stimulus came 
from List 1 or 2 (again without a time limit). 
Following this, Ss were told they could add as 
many other stimuli as they remembered at 
that time, even if they were not absolutely 
sure of all of the letters. The R-S recall data 
were therefore comparable to the Method 1 
and Method 2 S-R data of Goggin's (1963) 
study. 

The stimuli for the two lists were nonsense 
syllables selected from the 60 to 73% range 
of Archer's (1960) standardization, with both 
intra- and interlist. similarity minimized as 
much as possible. 'The response terms were 
selected. from the adjectives employed by 
Barnes and Underwood. 

Results. —For all five groups combined, the 
mean number of trials to criterion on List 1 
was 11.48. Comparability among the five 
groups in List 1 performance is attested to by 
the lack of a statistically significant difference 
between the means for trials to criterion, 
F (4, 10) = 1.01, p > -25. R-S recall scores 
were considered correct in Method 1 (after 
Goggin, 1963) if the stimulus term was paired 
with the right response term and given the 
correct list number. In Method 2 (after 
Goggin) the stimulus was counted correct if it 
was recalled, whether or not the response with 
which it was paired and the list number were 
remembered. Summary data for the four 
experimental groups on both lists and both 
methods are given in Table 1. 
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TABLE 1 
SUMMARY DATA FOR R-S RECALL ON LISTS 1 AND 2 


Trials on List 2 


Method 1 5 10 20 
M SD M SD M SD M SD 
1 4 2 
i 1.64 1.93 1.62 2.07 2.19 1.40 1.1 
Lx 2 213 1.92 4.33 2.77 4.27 1.67 6.80 1.47 
i S 1.50 2.46 1.92 2.40 1.99 1.73 1.44 
rit i $n 2.05 5.27 2.40 5.13 1.96 6.87 1.12 


From Table 1 it is apparent that, with 
practice on List 2, not only are the R-S 
associations of List 1 extinguished, but also 
that the List 1 stimuli themselves become un- 
available for recall. The analyses of variance 
for List 1 revealed a significant group effect 
on both recall scores—for Method 1, F (3, 56) 
= 7.53, p < .001; for Method 2, F (3, 56) 
= 6.60, p < .001. It is also apparent that, 
with increasing practice on List 2, both the 
stimuli and R-S associations of List 2 become 
increasingly available for recall. Analyses of 
variance again revealed a significant group 
effect on both recall scores—for Method 1, 
F (3, 56) = 13.39, p < .001; for Method 2, 
F (3, 56) = 8.62, p < .001. 

The control group checked the possibility 
that the decline in List 1 R-S scores could be 
attributed to simple forgetting. The mean 
score on both methods for this group was 5.73, 
a value significantly higher than the 20-trial 
experimental group—t’s (28) of 8.45 and 7.71 
for Methods 1 and 2, respectively—indicating 
that the decrement in recall of B-A associa- 
tions cannot be attributed to forgetting, 
but rather to the result of learning B-C 
associations. 

Interpretation of the List 1 data for both 
Methods 1 and 2, however, is complicated by 
the presence of possibly confounding factors. 
Method 1, for example, may be measuring 
both degree of R-S extinction and the inability 
to differentiate between lists. That is, R-S 
associations required both recall and list 
identification in order to be scored as correct 
by this method. In an attempt to eliminate 
this differentiation factor, a corrected Method 
1 score was computed for each S in the experi- 
mental groups by adding to his score all 
associations correctly recalled but placed in 
the wrong list. The lack of a pronounced list- 

differentiation effect, however, is indicated by 
the relatively slight change in means, with the 
corrected Method 1 means being 4.13, 2.00, 
2.13, and 1.53 for the 1-, 5-, 10-, and 20-trial 


groups, respectively (see Table 1 for the un- 
corrected means). Moreover, increasing ex- 
tinction with practice increments is still 
apparent from the statistically significant 
group effect, F (3, 56) = 6.56, p « .001. 

Similarly, Method 2, as an index of stim- 
ulus availability, is limited by the procedure 
employed. To be scored as correct by this 
method, a stimulus had merely to be named, 
regardless of response pairing and list identifi- 
cation. The naming of stimuli under these 
conditions may be confined to those stimuli 
for which there are intact associations, unless 
S has other stimuli available for which he 
attempts to guess the response pairing and list 
membership. A guessing set was not in- 
tentionally generated in this study, and it is 
conceivable that Method 2 underestimates 
stimulus availability under these conditions. 
Consequently, it is difficult to estimate the 
degree of extinction of R-S associations be- 
yond that involved in the unavailability of the 
stimuli per se. Nevertheless, it should be 
noted that for all four experimental groups 
the corrected Method 1 means are below the 
Method 2 means and that for two of the four 
groups the difference in means reached 
statistical significance—p < .05; correlated 
t's (14) = 1.80, 2.47, 2.29, and 1.39 for 1, 5, 
10, and 20 trials, respectively. 
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MOMENTS OF AREA AND OF THE PERIMETER OF VISUAL 
FORM AS PREDICTORS OF DISCRIMINATION 
PERFORMANCE * 


LEONARD ZUSNE? 


Purdue University 


2 experiments were performed to evaluate the usefulness of the 2nd, 
3rd, and 4th moments of distribution of the area and of the perimeter 
of random shapes as predictors of performance in a discrimination task. 
Repeated measures of RT were taken on 44 Ss who responded to 80 
6-choice oddity-tyPe problems defined by the presence or absence of a 
difference in the 3 moments. Both moments of area and of the perim- 
eter proved to be significant (p < 01) predictors of performance. 
Computer method for the computation of moments is presented, and the 
difference between the 2 types of moments, methodological problems, 


their solution, and the advantages of moments as compared to other 


form measures are discussed. 


Bier ies 10 and 80 physical meas- much less to the establishment of a 
nd, of visual form have been defined few basic parameters. , 
il used in form-perception experi- _ Four of these measures —informa- 
Tents since 1948 when the formula- tion content, linearity of the contour, 
tion of information theory gave im- area, and rotation—belong to one 
Petus to the development of metrics of class. They differ from the rest in 
visual form, With a few notable ex- that they do not directly describe the 
Ceptions, this profusion of measures manner in which form contours are 
as not led to a definition of the limits distributed in space. Such a descrip- 
of the multidimensionality of form, tion has been the main difficulty so 
i far, and it accounts for the large 
number of measures that make up 


ted in part 
roup. Among these, 
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1960; Attneave, 1957; Bitterman, 
Krauskopf, & Hochberg, 1954; Cas- 
person, 1950; Fitts, Weinstein, Rap- 
paport, Anderson, & Leonard, 1956; 
Lamar, Hecht, Shlaer, & Hendley, 
1947; Zusne & Michels, 1962a, 1962b). 
It appears now that the definitions of 
these three parameters can be im- 
proved, indeed, that these and other 
related measures may all be brought 
to a common denominator. 

In statistics, a unidimensional sam- 
pling distribution is specified by 
stating its mean, variance, skewness, 
kurtosis, and other higher moments. 
The area of a shape may be thought 
of as a two-dimensional distribution of 
its elements, i.e., small units of area. 
Such a distribution may be specified 
by stating its moments of area. When 
it is realized that the three physical 
form measures just mentioned are 
related to the second (compactness 
— variance), third (skewness — sym- 
metry), and fourth (kurtosis = elon- 
gation) moments of a sampling distri- 
bution, the investigation of moments 
of area acquires additional interest. 

Alt (1962) appears to be the only 
investigator to have used moments of 
area to specify two-dimensional dis- 
tributions, but his work does not deal 
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with the behavioral correlates of 
moments. The overlap measures 
developed by Monty and Boynton 
(1962), while not explicitly recognized 
as such by the authors, are related to 
moments of area. 

A pilot study showed that when 50 
Ss were given the task of drawing a 
straight line through each of 100 
eight-sided random shapes so as to 
maximize the symmetry of the two 
halves of the shape, the location of the 
line coincided or was very close to 
that axis about which the moments of 
area were either at their minimum or 
maximum values. The two experi- 
ments described below were designed 
to further test the utility of moments 
as predictors of performance. 


EXPERIMENT I 
Method. 


Method of computing moments of arca.— The 
computation of moments of area amounts to 
double integration over the rectangle contain- 
ing the shape (two sides of which are the 
coordinate axes). Using the Lebesgue inte- 
gral, for one variable X moments are defined as 


U: = ii (X — Xy4A, 


R 


where k is the order of the moment, R is the 
region within the boundary of the form, X is 
the distance of an incremental area dA from 
the origin, and X is the distance of the 
centroid from the origin. 

Since the computation of moments is à 
very time-consuming task, a computer pro- 
gram was written for this purpose. In general, 
the basic unit of which the various moments 
of area are composed is the distance-area 
product, i.e., a unit area multiplied by its dis- 
tance from a coordinate axis. Where the 
shape is positioned in the coordinate system 
is immaterial for the purposes of computer 
operations. A shape for which moments of 
area are to be computed is divided into small, 
discrete area units. These units are coded by 
1 punches (nonarea by 0 punches) in IBM 
cards, The computer operation amounts to 
summing these unit areas, multiplying them 
by their respective distances from the co- 
ordinate axes, and evaluating the computa- 
tional formulae for the moments. Looking at 
it in another way, the shape is projected unto 


PERIMETER MOMENTS AND DISCRIMINATION 


an axis (unidimensional collapse) and mo- 
ments of distribution are computed on the 
resulting frequency distribution polygon of 
the unit areas of the shape. Rotating the 
shape changes the form of this distribution 
polygon, hence also the values of the com- 
puted moments, The computer program was 
written for the first four moments (later aug- 
mented to first eight) to be computed at each 
5° of rotation of the shape. 

Subjects. —The Ss were 11 male and 11 
female students enrolled in an introductory 
psychology course at Purdue University. 

_ Stimuli and apparatus.—A pool of 100 
eight-sided random shapes was constructed 
according to Method I of Attneave and 
Arnoult (1956). ‘These were the 100 shapes 
used in the previously mentioned pilot study. 
The shapes were equated on area (19.22 sq. 
in.), divided into units of .1-5q- in. area, and 
the first four moments were computed for 
cach shape. The stimuli were selected from 
this pool of shapes, as described below. The 
stimulus shapes were cut from black paper and 
arranged in problems as shown in Fig. 1; ie. 
two shapes, one of which was replicated five 
aues The problems were photographed on 
s -mm. negative film and each film frame was 

ounted as a 2-in. projector slide. 
dem Kodak Carousel slide projector pro- 

cted the slides on a 15-in. square Albanene 
ced Screen mounted in the front wall of a 
sids us booth. The S sat facing the other 
iv of the screen, which was about 23 ft. 
ances n. he booth was equipped with a re- 
ranged i oard having six button switches. ar- 
Proble in the same way as the six shapes in a 
Stand ue Reaction time was recorded by a 
ä risa Electric timer in one-hundredths of 
Project E Located between S’s booth and the 
the n ton was E's control board equipped with 
the Cur. controls for the presentation of 
Bee and for closing the appropriate 
Experimental design.—The high correlation 
Ra. between the second and the fourth 
axis den computed about the perceptual 
tioned symmetry defined in the above-men- 

is pilot study, made it impossible to vary 
aN two parameters independently. How- 

a the function of the second moment (i.e, 
ede of its values computed in successive 
metet through a complete cycle) is sym- 
Beina a about its maximum OF minimum 
of th + in addition, the graph of the x function 
8ra h second moment is a mirror image of the 
NOE of the y function. For this reason the 
MM moments sum to à constant in any 
Pendens of the shape. This rotation-inde- 
Not cut measure of the second moment di 

correlate too highly with either the third 
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or the fourth moments, which determined its 
use in the experiment. 

Statistically, the design was a 2 X 2 X 2 
X 5 factorial design with repeated measures 
taken on all factors. The first three factors 
were: sum of the second moments (second 
moment constant), the third moment about 
the perceptual axis of symmetry, and the 
fourth moment about the same axis. (To 
make moments more concrete, they will be 
provisionally referred to in the following as 
areal dispersion or Da, areal symmetry or Sa 
and areal kurtosis or Ka.) Each of these 
factors occurred as either “present” or “ab- 
sent” in terms of a difference in their values 
between the two shapes that constituted a 
discrimination problem. The fourth factor, 
nested under the first three, was problems 
within the eight experimental conditions. All 
factors were considered to be fixed. 

Forty six-choice, oddity-type discrimina- 
tion problems (5 for each experimental condi- 
tion) plus 6 practice problems were con- 
structed as follows: From the 100 shapes, 
pairs of shapes were selected to represent the 
eight possible combinations of presence or 
absence of a difference in the value of the 
three moments. The size of the difference was 
about one-fourth of the total range of the 
respective moment for the 100 shapes. 

The shapes always appeared with their 
perceptual axes of symmetry in the vertical 
orientation, The location of the “different” 
shape was randomized. Each problem was 
presented once to each S, the order of pres- 
entation also being random, except that the 
six practice problems always preceded the 
series. This order was reversed for one-half 
of the Ss of each sex. 

The Ss were tested individually. The in- 
structions directed S to find the “different” 
shape among the six and to press the button 
on the response board which corresponded to 


he S was informed that his RT 


its location. T. 
would be measured and that he was to 


respond as fast as possible without making an 
error. The Swasalso instructed to useonly his 


writing hand for responding. The E recorded 
the RT and proceeded with the next slide. 


Results 

Unweighted means analysis on sex 
differences showed that this factor did 
not affect the RT significantly. Ex- 
cept for the practice problems, there 
was no learning over blocks of prob- 
lems in any of the 5s. The practice 
problems were not used in the com- 


putations. 
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TABLE 1 
ANALYSIS OF VARIANCE OF Loc (1 + RT) To DISCRIMINATION PROBLEMS IN Exp. I AND II 


! 
Exp. I Exp. H 
Source | adf 
| MS Error F MS Error F 
Between Ss 21 | 
Within Ss 858 | 
Second moment con- | | | 
stant (A) 1 | .04014 .00354 | 11.34** | .07870 .00554 14.21** 
Third moment (B) 1 03536 00446 | 7.93* | .03022 .00329 9.18** 
Fourth moment (C) 1 .00342 .00572 |  .60 .07990 | .00556 | 14.37** 
Probl. within A,B;C;* 4 -00104 00492 21 .00622 | .00405 1.53 
Probl. within A;BiCo 4 102315 00232 9.98** .03756 .00363 10.34** 
Probl. within A;BoC; 4 00511 00254 2.01 03237 00496 6.59** 
Probl. within AiBoCo | 4 .02621 .00452 5.79** | .00745 .00302 2.46 
Probl. within AoBiC; 4 | .10129 .00367 21.809 .00882 .00383 2.30 
Probl. within AoBiCo 4 | .02422 | .00249 9.72** | .05894 | .00586 | 10.06** 
Probl. within AoBoC, 4 .01332 .00409 3.25" .00619 .00222 2.79% 
Probl. within AoBoCo 4 | .22219 | .00420 | 52.90** | .03540 | .00454 | 7.79** 
AXB 1 .01161 -00311 3.73 .02009 .00217 9.26** 
AXC 1 -00236 .00247 .96 .04553 .00588 7.74** 
B XC | 1 | .33097 | .00719 | 46.03** | .00365 | .00411 .80 
AXBXC 1 .00594 .00312 1.90 .08536 .00336 | 25.40** 
^1 = difference present, 0 = difference absent. 
*p «.05. 
**p» <.01. 


A plot of the distribution of RTs 
showed positive skewness. A log 
(1 + RT) transformation was there- 
fore applied to the data. A summary 
analysis of variance of the trans- 
formed RTs appears in Table 1. Two 
main conclusions can be drawn from 
this analysis. First, the presence or 
absence of a difference in either Da 
or Sa significantly affected Ss’ RT 
(p « .01). No such effect appeared 
in the case of Ka. In addition, the 
higher was the order of the moment 
the smaller a proportion of the total 
variation was accounted for by it. 


Second, in six out of the eight experi- 
mental conditions problems signifi- 
cantly affected the results, indicating 
that heterogeneous samples of prob- 
lems had been constructed. 

Of the four interactions only one 
was statistically significant. Analysis 
of simple main effects showed that the 
interaction between Sa and Ka oc- 
curred as follows. Performance was 
best when a difference in only one of 
the moments was present. [n this 
situation Sa was a significantly 
(p < .01) better predictor than Ka. 
When differences in both moments 


‘TABLE 2 


MEAN REACTION TIME (1n SEC.) TO DISCRIMINATION PROBLEMS 
iN EiGHT EXPERIMENTAL CONDITIONS or Exp. I 


Experimental Cond.^ 


pes 4 | ABoCo | ABiCi | ABC: | AiBiCo | AiBoCr | AoBiCo 

| ABoCo | AcBiCr | M 2 I F = 

| e|. T ou 
a 1 3 | 129 | 120 1.19 | 119 

EE 6l | 145 | bB E l a LLL PS UR 3: : 
Mean RT 1 DEM eee 2 
1 — I differ significantly (conservative Newman-Keuls test). 
Note. — Underlined conditions do not differ significant [fourth moment, 1 = difference present, 0 = differ- 


a A = second moment constant, B 


ence absent. 


PERIME 


were present, discrimination deteri- 
orated considerably (p < .01), indi- 
cating interference. Performance was 
poorest, however, when differences in 
both moments were absent (p < .01). 

Table 2 shows the mean RTs for the 
eight experimental conditions. It is 
interesting to note that a difference in 
Da or Ka alone resulted in a consider- 
ably poorer performance than when 
differences in both moments were 
present simultaneously, while a differ- 
ence in Sa alone led to best perform- 
ance and the addition of either Da or 
Ka or both deteriorated it. 


Discussion 

The probable reason why the main 
effect of Ka did not turn out to be 
Statistically significant is discussed under 
Exp. IL. The interaction between Sa 
and Ka was highly significant, however. 
The pattern of interaction indicated 
interference between these two variables. 
Whether the observed interference was 
Mutual or unilateral is not clear, and 
further experiments will have to be con- 
ducted to establish this point as well as 
the exact reason for the interference. , 

It was noted that in each of the six 
experimental conditions showing hetero- 
Seneity among problems it was due to 
only one or at most two markedly 
deviant problems. It was established 
that of the various possible sources of 
J€terogeneity only two were actually 
involved, One source was the algebraic 
Sign of Sa, While the absolute values of Sa 
Ot a shape in 0 and 180° rotations are the 
same, they differ in algebraic sign. Thus 
' some problems with no difference In Sa 
the differing algebraic signs introduced a 
IScriminable clue in the form of discrimi- 
nable directionality of the two shapes. 

e characteristic of the second group 
deviant problems can be illustrated 
Y comparing, for instance; @ square we M 
'* same square from which a short, 
pointed spike protrudes. The | 
;JUares are clearly discriminable, yeti 
cause moments of area are mena 
mie nali protruding arcas, the pec m 

ments of the two squares hardly difter. 


of 
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The first source of heterogeneity of 
problems can be eliminated by simply 
rotating one of the two shapes 180°, 
Two possible ways of eliminating the 
spike" problem suggested themselves, 
namely the use of moments of the perim- 
eter instead of moments of area and/or 
the use of more than the first four 
moments to specify problems. These 
possibilities were explored in a second 
experiment. 

EXPERIMENT II 

Although moments of area describe 
the distribution of the area enclosed 
by the contour of a shape, this area is 
really “empty” in that it contains no 
information—all information in a 
simple, two-dimensional, black-and- 
white polygon is contained in its con- 
tour. This has been shown by Att- 
neave and Arnoult (1956) and is 
indicated in eye-movement studies 
(Brandt, 1945; Buswell, 1935). A 
study of eye movements in response 
to random and geometric polygons 
(Zusne & Michels, 1964) has shown 
that in such stimuli it is the outlines 
of the shape that are scanned, 
especially the intricate, interesting, or 
important portions of it, while the 
enclosed area is almost completely 
ignored. ]t seemed indicated, there- 
fore, to test the utility of moments of 
the perimeter, ie., moments of shape 
weighted entirely along its contour, 
leaving the enclosed area out of 
consideration. 

The above-mentioned "spike" phe- 
nomenon also suggested the possi- 
bility that heterogeneous problems 
had been constructed because not all 
of the relevant moments of shape had 
been specified. This brings up what 
is known in mathematics as the 
“problem of moments,” which is 
simply this: if all moments of a 
distribution are given, is this distribu- 
tion thereby uniquely specified? The 
answer to this problem has not yet 
been given, and it is well to note that 
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TABLE 3 


INTERCORRELATIONS AMONG MoMENTS OF AREA AND MOMENTS 
OF THE PERIMETER OF 100 RANDOM SHAPES 


U2ae | U2a | U3a | Usa | U2p. | U2p | U3p | Usp | USp | Usp | U7p 
U2a. = 53 ES .62 .88 .36 .08 .32 4 .30 21 
U2a — 56 91 31 .66 4 .62 .22 35 332 
U3a — 46 34 -76 .23 -79 33 .78 i 
U4a — 42 Ad ll 46 .20 AS 28 
U2pe — 39 07 34 14 Ky 23 
U2p = 23 94 34 .86 ES 
U3p — 27 94 .29 74 
U4p = 40 | .97 | .56 
USp — 44 .87 
Uóp = 61 
U7p — 


Note.—Small letters designate a moment of area or of the perimeter, number indicates the order of the moment, 


and c is the second moment constant. 


it has been raised for one-dimensional 
distributions only. In a concrete case 
the specification of “all moments" is 
effectively curtailed by the capacity 
of even the largest computers, which 
limits the investigator to the speci- 
fication of “more” rather than *'all" 
moments. 


Method. 


Subjects.— l'wenty-two male students en- 
rolled in an introductory psychology course at 
Purdue University served as Ss. 

Stimuli and apparatus.—The same pool of 
shapes was used as in Exp. I. In punching a 
shape only its contour was represented by 1 
punches, while area both outside the shape 
and inside it was coded by 0 punches. An 
amplified but essentially the same computer 
program was used to compute the first eight 
moments of the perimeter for the 100 shapes. 

The method of stimulus presentation and 
the apparatus used were the same as in 


Exp. I. 


Experimental design.—Table 3 shows that 
all even moments of the perimeter were highly 
correlated, as were all odd moments. This 
made the construction of problems using 
differences in the size of all moments both im- 
possible and unnecessary. As before, the 
second moment constant did not correlate too 
highly with any of the other moments. This 
measure plus the third and fourth moments of 
the perimeter computed about the perceptual 
axis of symmetry were used to define the ex- 
perimental conditions. In the following, these 
measures will be referred to as perimetric dis- 
persion or Dp, perimetric symmetry or Sp, 
and perimetric kurtosis or Kp, respectively. 

Forty discrimination problems were con- 
structed according to the plan used in Exp. I. 
The statistical design, instructions, and ex- 
perimental procedure were identical to those 
used in Exp. I. 


Results 


An analysis of variance was per- 
formed on log (1 + RT). A summary 
of the ANOVA appears in Table 1. 


TABLE 4 


Mean Reaction TIME (IN SEC.) TO DISCRIMINATION PROBLEMS 
IN EIGHT EXPERIMENTAL CONDITIONS or Exp. II 


Experimental Cond.* 


AoBiCo | AoBoCo AoBiCi 


AiBiCo 


A1BoCi AiBiCi AiBoCo AoBoC1 


i3 | 184 1.43 


1.40 


1.39 1.26 1.25 


1.22 


not 


Underlined conditions do 
a A = second moment constant, 


ence absent. 


differ significantly (conservative 
B = third moment, C = fourth moment, 


Newman-Keuls test). 
1 = difference present, 0 = differ- 


A Ree ee Cee) See D EMI P sc ern rmn mer P 
2 x xL 


In contrast to Exp. I, the main 
effects of all three moments were 
found to be statistically significant 
(p < .01), with no decrease in the 
size of the F ratio with increased order 
of the moment. Similar to Exp. 1, 
problems within experimental condi- 
tions contributed significantly to the 
total variation; the overall hetero- 
geneity of problems, however, was less 
pronounced than in Exp. I. 

Of the four interactions only 
one was not statistically significant. 
Analysis of simple main effects showed 
the following patterns. As long as a 
difference in Dp was present, the 
presence or absence of a difference in 
Sp did not have any appreciable effect. 
When no difference in Dp was present, 
the presence of a difference in Sp 
significantly increased RT (P < .01). 

The interaction of the two even 
moments, however, was quite differ 3 
ent. The RTs did not differ signifi- 
cantly among themselves when a 
difference in either Dp, or Kp, or both 
Was present, Performance did de- 
teriorate significantly (^ < .01) when 
a difference in both moments was 
absent, Table 4 shows the mean RTs 
for the eight experimental conditions. 
When this table is compared with 

able 2, a very definite, systematic 
relationship is seen: both the easiest 
and most difficult conditions in Exp. 
Were the most difficult in Exp. I, 
While conditions of medium difficulty 
Were the same in both experiments. 


Di E 
tScussion í 
ne degree O 


The i ift in tl 
. s c shift in 
ystematic sh mental con- 


ifficulty between the exper! D 
itions of the two experiments may ^ 
"Xplained, partially at least by En 
ference between the configurationa 
Properties represented by momen 
“rea and moments of the perimeter. h 
p bles of shapes with the highest and the 
rest values of moments of area Ke 2 
ar, Perimeter in the pool ot 100 83-P t 
re shown in Fig.2. While Sa is a direc 


Iv M VI 
Vil vill Ix 


Fic. 2. Examples of shapes high and low 
on the three measures of moments used in the 
two experiments. (These shapes had the 
highest and the lowest values of these mo- 
ments among the 100 shapes in the original 
sample. I—high Da and Ka; II—low Da and 
Dp; IlI—high Sa; IV—low Sa; V—low Ka 
and Kp; VI—high Dp; VII—high Sp; VIII— 
low Sp; IX—high Ep.) 


index of the symmetry of the two halves 
of a shape about its perceptual axis of 
symmetry, Sp expresses the balance be- 
tween the two halves in terms of the 
length of the perimeter in each half. 
When the perimeter is compressed in a 
small area, it produces perimetric asym- 
metry without necessarily producing 
areal asymmetry. This lack of relation- 
ship is reflected in the rather low correla- 
tion coefficient between these two meas- 
ures: (r-.23). In the experimental 
condition where a difference in only Sp 
was present, the asymmetrical distribu- 
tion of the perimeter in the “different” 
shape was not easily noticeable, which 
probably accounts for the fact that this 
condition turned out to be the most 
difficult one. 

The dispersion of the perimeter in 
terms of Dp, however, is closely related 
to the dispersion of the area of a shape 
(r= .88), and once again proved to be a 
valuable predictor of performance. The 
Kp measure, on the other hand, is not 
too highly related to Ka (r= .46). 
Shapes that are leptokurtic in terms of 
Kp tend to have one or more contour 
lines in a vertical position. Such a shape 


em 
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turned 90° is platikurtic. Thus a differ- 
ence in Kp may show itself as a difference 
in the directionality of the shape, which 
is more readily perceived than an equal 
difference between two shapes in terms 
of Ka. The elongation described by the 
latter, being a sort of average of all 
portions of the shape, apparently needs 
much larger differences to be perceived. 

The difference between the two types 
of the third and fourth moments is quite 
probably also related to the fact that the 
one significant interaction (between Sa 
and Ka) in Exp. I turned out to be the 
only nonsignificant interaction in Exp. II. 
The present experiments, however, do 
not provide sufficient grounds to hy- 
pothesize about the exact nature of the 
relationship between the third and the 
fourth moments. 

The "spike phenomenon” could not be 
completely eliminated by the use of 
moments of the perimeter, but its magni- 
tude was reduced. It appears, therefore, 
that in future experiments differential 
weighting of the perimeter would have 
to be resorted to by weighting percep- 
tually conspicuous portions of the perim- 
eter more than its other parts. 

The investigation reported in this 
paper was only an initial step in the 
evaluation of moments as form param- 
eters, and many questions remain to be 
answered and many problems solved. 
Nevertheless, some of the advantages of 
moments as compared with the form 
measures of this category used so far are 
apparent even now : (a) Instead of a large 
number of unrelated measures a small 
number of measures having the same 
mathematical basis is used. (b) The 
possible uniqueness of representation has 
already been mentioned. Such a repre- 
sentation, if a fact, means that moments 
exhaust all possible configurational as- 

pects of two-dimensional visual display s 
(c) Moments permit the quantification of 
form on continua rather than ina diserat 
or dichotomous fashion. (d) Fine i 
functions of moments through a complete 

i ermits fast, easy, and 
cycle of rotator R f sh on the 
quantified comparison’ OL siapa do by 
various characteristics represente hat 
the moments. (e) Finally, ppá en 
the advantage of possessing a reta ively 
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broad interdisciplinary basis: the concept 
of moments is a well-established one in 
mathematics, statistics, physics, and the 
several branches of engineering. 
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STIMULUS CORRELATES OF VISUAL PATTERN 
DISCRIMINATION BY HUMANS: 
AREA AND CONTOUR! 
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Wisconsin Regional Primate Research Center 


‘This experiment wa 
rhesus monkeys. ! 
patterns was virtually perfect 
contour dimensions 0 
monkeys on the same 


This research program is concerned 
with identifying the physical stimulus 
dimensions utilized by primates in 
discriminating visual patterns. The 
initial report (Polidora & Thompson, 
1964) established that the disparity 
between (a) the areas and (b) the con- 
tours of the two patterns which com- 
prised a simultaneous discrimination 
problem were both direct, monotonic 
determinants of discriminative per- 
formance of monkeys. Further, the 
contributions of the area and contour 
dimensions were shown to be in- 
dependent and additive. To deter- 
mine the extent to which these same 
dimensions relate also to human dis- 
Criminative performance, the present 
experiment was designed as an exact 
replication of the monkey study. 


METHOD 

and 16 female 
f Wisconsin 
Laboratory 
naive with 


Subjects. —Sixteen male 
employees of the University © 
timate Center and Primate 
volunteered as Ss. All were 

respect to the experiment and problem. 

b Abbaratus.—Except for the changes noted 
Clow, the apparatus described in the monkey 

report (Polidora & Thompson, 1964) and in 

an apparatus paper (Polidora & Main, 1963) 

Was used. Briefly, it consisted of a key punch 

Se 1 
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sa replication, with human Ss, of an earlier one with 
Juman proficiency of discriminating visual metric 
and less directly related to the area and 
f these patterns than was the performance of 
discrimination problems. 


which concurrently programed stimuli and 
recorded responses, a test panel which con- 
tained the displays and manipulanda, and a 
control console. For this experiment the test 
panel was removed from its usual position in 
front of the monkey's cage and placed on a 
table vertically in front of 5. The E adjusted 
S's seat so that the displays were at eye level 
18 in. away. Each of the two displays con- 
sisted of 12 $in. square lights (elements) 
mounted adjacently in a 3 X 4 array (see 
Polidora & Main, 1963, Fig. 5). Instrumental 
responses were made by S's pressing the 
bottom-hinged transparent panel overlaying 
each display. When an instrumental response 
was made, both displays extinguished im- 
mediately, a bell sounded once if the response 
was to the correct pattern, and the intertrial 
interval (ITI) ensued. 

Trial presentation was under S's control. 
By holding closed a spring-loaded lever switch 
(the analog of the monkey's mask response), S 
presented trials at a rate determined by the 
duration of the preset ITI and S's response 
times. If S's switch was closed, the displays 
were lighted at the expiration of the ITI and 
remained on until S responded; if the switch 
was released, the stimuli did not light, and 
responses to the panels were ineffective. 

Stimuli.—Each stimulus pattern consisted 
of a luminous polygon formed by the lighting 
of certain elements in a display. By using 
the same IBM program cards as in the 
monkey study, the patterns presented to the 
human Ss were identical to those presented 
to the monkeys. Briefly, only patterns com- 
posed of 5, 6, 7, or 8 elements were used. A 
pair of patterns was selected for each two- 
choice problem such that the difference be- 
tween the number of ‘‘on” elements in the two 
patterns was either 0, 1, 2, or 3. This differ- 
ence, termed Element Disparity (ED), con- 
stituted the area dimension. For example, a 
S.element pattern paired with a 7-element. 
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pattern (5 X 7) produced a problem at ED 2; 
6 X 7 problems were ED 1; 8 X 8 problems, 
ED 0; etc. All 10 combinations of the four 
number of elements (5 through 8) were used. 

The second dimension, Contour Disparity 
(CD), was an index of the difference between 
the number of straight line segments around 
the periphery of each of the two patterns. 
Since the number of sides on any of these 
patterns is invariably a multiple of 2, CD was 
expressed as the difference in sides divided by 
2, so that a minimal CD increment had a 
value of 1. 

It should be understood that each pair of 
patterns had both an ED and a CD value. 
The 500 problems presented to each S con- 
stituted 10 replications of the 50 ED-CD 
problem classes. 

Experimental design—The experimental 
design was two replications of a factorial 
combination of 4 levels of ED (0-3), 5 levels 
of CD (0-4), and 5 levels of ITI (2, 4, 6, 8, 
and 10 sec.). 

Procedure—The only procedural differ- 
ences between this and the monkey study 
were in the number of trials per problem and 
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problems per session. The human Ss had 
only two trials per problem, 100 problems per 
daily session for 5 consecutive weekdays. A 
5-min. break bisected every session. Also, 
each session was run at one ITI, and every .S 
received a different random assignment of the 
5 levels of ITI to the 5 sessions. At the 
beginning of the first session S received a 
description of the task, an explanation of the 
win-stay, lose-shift strategy of solving two- 
trial discrimintion learning set problems, and 
coached practice on three standard problems. 


RESULTS AND DISCUSSION 


Since there was a statistically sig- 
nificant difference between perform- 
ance on the first half and second half 
of each of the five sessions, F (1, 30) 
= 13.57, p < .001, the results are 
presented in Fig. 1 by half session. 
(Inspection of the data revealed that 
this practice or warm-up effect was 
most pronounced in the first session.) 
These results indicate that although 
the patterns were discriminated ex- 
tremely well in general, values of ED 


CORRELATES OF VISUAL PATTERN DISCRIMINATION 


and CD correlated moderately well 
with performance within the 2% 
(97%-99%) range of variation. How- 
ever, only the CD dimension was a 
statistically significant source of vari- 
ance, F (4,120) = 4.04; p< .005; 
ED, Sex, ITI, and all interaction 
sources were nonsignificant (p> .05). 

Each point of the CD functions in 
Fig. 1 is the mean value computed 
across all levels of ED, and the con- 
verse is true for the CD functions. 
To determine the "pure" contribu- 
tions of ED and CD to discrimina- 
bility, mean performance on problems 
with an identical number of elements 
in each pattern (i.e., no area cue) was 
computed for each level of CD (CD 
function at ED 0), and the analogous 
function was computed for ED (ED 
function at CD 0). These functions, 
presented in Fig. 2, support an inter- 
pretation that both ED and CD were 
utilized as sources of discriminability. 

It seems likely that a ceiling effect 
attenuated the performance differ- 
ences along the CD and ED dimen- 
sions, differences which might other- 
wise have been obtained with pattems 
of finer “grain” (such as those use y 
Webster, 1963, and others cited by 
him) or with a measure more sensitive 
than percentage correct responses. 
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Although we do not plan to study 
finer-grain patterns, future research 
will attempt to determine whether 
response time (latency) is as sensitive 
a measure of discriminability as has 
been suggested by Brown, Hitchcock, 
and Michels (1962). While these in- 
vestigators have also reported es- 
sentially perfect discrimination of 
visual patterns by human Ss, they 
found that response time in a six- 
choice discrimination task was related 
to the “complexity” (sidedness) of 
random shapes. 
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Ss were required to count the number of occurrences of each of several 
different symbols (categories) presented sequentially as a function of 
the number of categories (2, 3, or +), the total number of items (8, 12, 
16, or 20), and the rate of presentation (one item every 0.6, 1.0, 2.0, 3.0, 
and 4.0 sec.). In general, the greater the number of categories, the 
greater the total number of items, or the faster the rate of presentation, 
the poorer the performance. Of primary importance, however, is the 
complex interaction between these variables. The observed results are 
explained primarily on the basis that both presentation rate and number 
of categories affect the time available for rehearsal of the information 


while trial length results in an acc! 


In a series of experiments reported 
by Erlick (1961), Ss were asked to 
identify the more frequent of two 
mutually exclusive events occurring 
in a temporal sequence. The symbols 
representing these events (letters and 
numbers) were presented one at a 
time as an outline of light against a 
dark background. In one of these 
experiments both the rate of presenta- 
tion and the number of symbols in a 
sequence were varied. It was re- 
ported that at a presentation rate of 
one event every second, the shorter 
sequences resulted in better perform- 
ance than the longer sequences (as 
measured by the increment of the 
more frequent event required to 
maintain a 75% accurate identifica- 
tion of that event), while at the faster 

1The work was sponsored by Cornell 
Aeronautical Laboratory, Incorporated, in- 
ternal research funds. The authors gratefully 
acknowledge the assistance of Donald F. 
Dansereau and Charles Hall in gathering and 
reducing the data and the many valuable sug- 
gestions of William J. Ruby. Rr oe 
^ ? Also affiliated m the State University 
f Neu Vi ce 2 alo. 

a oe ee Cornell Aeronautical 


Laboratory, Incorporated. 


umulation of errors with time. 


rates of presentation of four and eight 
events per second, the reverse was 
true.  Erlick suggests that at the 
slower rate, individual events could 
probably be discriminated one at a 
time and mentally tallied so that ac- 
curacy was primarily a function of 
memory processes. At the faster 
rates, however, it was suggested that 
the events were presented too fast to 
be tallied separately, and therefore Ss 
reorganized the input sequences into 
larger units or “chunks” (after Miller, 
1956) which were the basic units for 
the mental tally. In this case, the 
advantage of the longer sequence was 
attributed to the increased time 
sample which in turn allowed for a 
more accurate estimate of the events. 
Thus, it appears that two different 
processes were brought into play, 
depending on the rate of presentation 
of the events. At the slower rate, 
performance may have been primarily 
a function of recall or memory proc- 
esses, while at the faster rates, com- 
plex perceptual processes were operat- 
ing in addition to the memory 
processes. 
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SEQUENTIAL EVENTS IN 


The present study was designed to 
examine human ability to mentally 
tally and recall events presented 
sequentially in a manner similar to 
that of Erlick at stimulus presentation 
rates which clearly allowed the in- 
dividual events to be discriminated. 
The S’s task in this experiment was 
to keep track of the number of times 
that each event occurred in a sequence 
rather than to simply indicate which 
event occurred most frequently. In 
Some respects, the demands on S 
resemble those imposed by Yntema 
(1963). In Yntema's experiments, S 
was read a series of messages, each of 
which informed him that one of a 
number of variables (events) was in 
Some particular state. Incoming in- 
formation displaced previous informa- 
tion so that .S had to essentially keep 
track of the present state of several 
variables and mentally erase past 
States, 


METHOD 


Subjects —The Ss were 150 students re- 
cruited from an introductory psychology 
course at the State University of New York at 
Buffalo. 

A pparatus.—The S w: 
a panel, the face of which w [ 
60° angle to his horizontal line of sight. An 
Industrial Electronic Engineers, Incorporated 
in-line readout display unit was mounted in 
the panel. The distance from the display to 
S's eye was approximately 2 ft. This display 
Unit, which closely resembled the one used by 
Erlick (1961), contained 12 lenses, each 
etched with one of the symbols to be presented 
to S. By turning on and off individual lights 
behind each of the lenses, the symbols were 
Projected one at a time onto approximately 
the same location of a 13 X 14 in. frosted 
glass surface, The symbols, which were 
letters of the alphabet, were seen asan end 
9f light against a dark background. Six suc 
display panels were located side by ie 
enabling as many as Six Ss to be ge 
Simultaneously. The Ss were separated by 
Partitions which effectively blocked their 
View of one another. 

. The sequences of letters ff 
display and the rate of presenta 


as seated in front of 
as positioned at a 


ashed on the 
tion were con- 
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trolled by Grason-Stadler Company be- 
havioral research equipment modules. This 


equipment was located in an adjoining room 
so that Ss could not hear the noise generated 
by the operation of the equipment. 

Procedure.— he Ss were assigned ran- 
domly to 15 equal groups, each serving under 
a different combination of rate of symbol 
presentation and number of categories. The 
rates were one letter every 0.6, 1.0, 2.0, 3.0, 
and 4.0 sec. with the on-time to off-time ratio 
being approximately equal in each case. The 
number of categories, defined as the number 
of different letters appearing on each trial, 
were 2, 3, and 4. When two categories were 
presented, the letters were Q and R; for 
three categories, Q, R, and S; and for four, 
Q, R, S, and T. 

Trial length, defined as the total number of 
letters presented on a trial irrespective of the 
category to which they belonged, was included 
asa within-Ss variable. All Ss were presented 
with trial lengths of 8, 12, 16, and 20 letters. 
To permit the assessment of practice, a total 
of 32 trials was presented. Each of the four 
trial lengths was presented twice in random 
order during the first 8 trials and twice more 
in different random orders in each of the 
remaining three blocks of 8 trials. There were 
four different sequences of letters for each of 
the four different trial lengths so that Ss were 
exposed to each of the different sequences 
only twice: once in the first 16 trials and once 
in the second 16 trials. "The sequences of 
letters were chosen at random with the 
restriction that each category appeared at 
least once on every trial. 

All Ss were instructed to mentally count 
the number of times that each of the different 
categories (letters) was pr ented in a trial 
and to record the answer at the end of the 
trial. ‘Two practice trials were given to be 
sure that each S understood the instructions. 
"These trials were followed by 32 experimental 
trials with a 15-sec. intertrial interval for 
recording answers. A loud clock signaled the 
beginning and end of each trial. In all cases, 
Ss had knowledge of the number of different 
categories to be presented and the specific 
letters that would appear, but no knowledge 
of the total number of letters to be presented 
per trial. Finally, Ss received no instructions 
concerning any method which might be 
utilized to mentally count the number of 
times each category appeared. 

Upon completion of the data collection, Ss 
were asked to describe in writing the tech- 
niques used to keep track of the number of 
occurrences of each category. 
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RESULTS 


Three indexes of performance were 
examined: (a) absolute error, defined 
as the sum of the absolute differences 
between the number of letters pre- 
sented in each category and the num- 
ber reported in each category; e.g., if 
5 Qs and 3 Rs were presented and S 
reported 6 Os and 2 Rs, the number 
of errors for that trial would be a total 
of 2, one for the Q category and one 
for the R category, (b) percentage 
error, defined as the ratio of the 
absolute error per trial to the trial 
length multiplied by 100, and (c) Ss’ 
written comments, 

The absolute error served as the 
primary measure of performance on 
each trial for each S. To assess the 
effects of practice, the two scores of 
equal trial length within each block 
were summed. This resulted in a 
single score for each of the four trial 
lengths within each of the four blocks. 
These scores were subjected to an 
analysis of variance with Rate of 
Presentation and Categories as be- 
tween-Ss variables and Trial Length 
and Blocks as within-Ss variables. 
The data contributing to the four 
significant main effects revealed that 
performance varied inversely with 
number of categories, F (2,135) — 52.9 
(p < .001), trial length, F (3, 405) 
— 200.9 (p « .001), and rate of pres- 
entation, F (4, 135) = 48.3 (p < .001) 
and varied directly with amount of 
practice, F (3,405) — 18.1 (p « .001). 

Of greater importance, however, are 
the various interactions between these 


i ing the 
variables. The data underlying t 
significant Trial Length X Catego- 
ries interaction, F (6, 405) = 14.3 


(p < .001), suggests that, as the 
length of the trial increased, perform- 
ance degraded in a linear fashion and 
that the rate of degradation increased 
as the number of categories increased. 
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Similarly, the data for the Trial 
Length X Rate interaction, F (12,405) 
= 18.4 (p < .001), indicates that, at 
the slowest rate of presentation, (one 
letter every 4 sec.), there were only 
slight differences in performance with 
the various trial lengths, but as the 
rate increased, these differences be- 
came gradually more pronounced. 
The data underlying the Trial Length 
X Rate X Categories interaction 
which yielded an F (24,405) = 2.0 
(p « .01) have been plotted in Fig. 1. 
It can be seen that when two catego- 
ries of information were presented, 
mean error increased as trial length 
increased at the faster stimulus pres- 
entation rates of one letter every 0.6 
sec. and 1 sec., respectively. How- 
ever, at the slower rates of presenta- 
tion there was little change in per- 
formance with an increase in trial 
length except for a slight increase in 
error with a trial length of 20 letters. 
In other words, when the rate of 
presentation was one letter every 2 
sec. or less, Ss kept track of events 
with almost perfect accuracy regard- 
less of the trial length. When three 
categories were presented, increases in 
mean error as a function of trial 
length for the 0.6- and 1-sec. rates 
were even more pronounced. ln addi- 
tion, the 2-sec. rate resulted in a 
similar increase in error although of 
less magnitude. Finally, with four 
categories of information, all rates of 
presentation resulted in an increase in 
error as a function of an increase in 
trial length. The mean error at a rate 
of one letter every 0.6 sec. resembled 
the mean error at the 1-sec. rate. 
Similarly, at rates of one letter every 
2 sec. and every 3 sec., performance 
was nearly identical, yet at the same 
time, decidedly better than perform- 
ance at the two faster rates. Finally, 
performance at the 4-sec. rate was 
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superior to performance at any of the 
above rates. 

The interactions of Blocks X Cate- 
gories, F (6, 405) = 2.2 (p < .05), 
Blocks X Trial Length, F (9, 1215) 
— 5.9 (p « .001), and Blocks X Trial 
Length X Categories, F (18, 1215) 
— 2.2 (p « .01), suggest that differ- 
ential practice effects were present. 
In general, the data indicate that the 
amount of improvement over blocks 
varied directly with length of trial and 
number of categories When two 
categories were presented, there was 
no evidence of learning. With three 
categories, learning occurred only for 
the trial lengths composed of 16 and 
20 letters. Finally, with four cate- 
gories, performance improved across 
the first three blocks for the 12-, 16-, 
and 20-letter trial lengths, and across 
the last two blocks for trial lengths of 
8 letters. Although learning was 
dependent upon both trial length and 
number of categories, the significant 
Main effects for Categories and Trial 
Length were in evidence even on 
Block 4. The fact that all the inter- 
actions with both Blocks and Rate as 
components failed to reach significance 
indicates that improvements in per- 
formance attributed to practice were 
Not related to the rate of presentation 
of the stimuli. 

a With respect to perce! 
le second performance 1nce? 
found that, a general, for a given d 
of presentation and a given num "s 
9! categories, performance across So 
lengths was essentially invariant. . 
© more specific, as trial lena mr 
creased from 8 to 20, there was on y 2 
% increase in mean percentage error 
With two categories, a 276 crease 
Vith three categories, and 5% in- 
Crease with four categories. In short 
ad percentage error been Lo dn ne 
Primary performance measure, the 
Mteractions with Trial Len 
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component would have failed to reach 
statistical significance. 

In their written comments, most of 
the Ss indicated that during early 
trials they had tried several different 
techniques of keeping track of the 
number of occurrences of each cate- 
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gory but eventually settled down to 
one they considered effective. With 
the exception of those tested at the 
rate of one event every 0.6 sec., 72% 
of the Ss indicated that they found 
it most useful to mentally picture a 
set of windows, one corresponding to 
each of the different letters presented. 
Individual running tallys were then 
kept in each window and only the 
tallys were rehearsed after each pres- 
entation of anew event. By contrast, 
another 16% of these Ss indicated 
that they rehearsed both the letter 
and the tally after each presentation 
without the aid of any form of mental 
picture. For example, the former 
group simply rehearsed 5, 4, 2, 3; 
while the latter would have rehearsed 
50s,4 Rs, 2 Ss, 3 Ts. The remaining 
12% reported using a variety of 
techniques including physical aids 
such as tallying on their fingers. At 
the presentation rate of one letter 
every 0.6 sec., the picture was some- 
what different. With two categories, 
7 of the 10 Ss indicated that they 
actually rehearsed only one of two 
categories each time and made an 
estimate of the number of occurrences 
of the other at the end of the trial. A 
similar report was made by 7 of the 10 
Ss presented with three categories. 
In this case again, only one category 
was actually tallied and rehearsed, 
while the other two were estimated. 
Finally, with four categories at a rate 
of one letter every 0.6 sec., there was 
almost no agreement between Ss 
regarding the technique employed. 
Further, there was some indication 
that many of the Ss were still search- 
ing about for an effective technique 
at the end of the 32 trials. 


DISCUSSION 


In Erlick's (1961) experiment on rate 
of presentation, 5s were asked to indicate 
which letter appeared most often rather 


than to tally the actual number of 
occurrences of each letter. He expressed 
performance in terms of the increment of 
the more frequent event over the less 
frequent event required to maintain a 
75% accurate identification of that 
event. Thus, differences in both Ss’ task 
and the performance measures employed 
preclude a direct comparison of perform- 
ance at those rates of presentation 
common to both studies. However, the 
small number of errors obtained with two 
categories in the present study, par- 
ticularly at the shorter trial lengths lends 
support to his hypothesis that, at slow 
rates of presentation (i.e., one event per 
second or less), individual events falling 
into two categories can be discriminated 
one at a time and mentally tallied. 
Erlick further suggests that at these slow 
rates if the events can be discriminated 
and tallied one at a time, then it is 
memory processes which play an im- 
portant role in the degree of accuracy. 
Verbal reports of Ss in the present study 
add credence to this hypothesis insofar 
as they indicate the extent to which 
rehearsal techniques were employed. 
It appears that, as long as the presenta- 
tion rate is slow enough to allow for 
rehearsal of all categories (assuming the 
number of categories is small and S's 
memory span is not exceeded), differences 
in performance are not likely to result as 
either the number of categories or the 
trial length increases. However, if the 
number of categories is increased to the 
point that a given rate of presentation 
precludes complete rehearsal of all items, 
then the number of errors made should be 
a function of the trial length. That is, if 
the last state of the items in each cate- 
gory has not been rehearsed at the time 
another symbol is perceived, an error is 
more likely to occur in remembering the 
states of the unrehearsed categories 
which in turn may result in an error in the 
new total or tally. Once an error is made, 
it is not likely to be corrected at a later 
time, particularly if the error itself is 
rehearsed. Errors then should be cumu- 
lative, and the longer the trial length, the 
greater the number of errors expected. 
It is therefore possible to account for the 
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observed interaction between Rate, Cate- 
gories, and Trial Length primarily on the 
basis of time available for rehearsal. 
However, it is not reasonable to gen- 
eralize this explanation to situations in- 
volving a greater number of categories, 
because other factors may be involved 
such as those associated with the classic 
digit span. Further research is necessary 
to determine the effect of increasing the 
number of categories beyond four. 

Although the present study utilized a 
less complex set of stimuli and required 
an incremental tally as compared to the 
substitution of new messages for old 
required in Yntema's (1963) study, there 
is a striking similarity in the general 
conclusion of both investigations. Both 
Studies indicate that for a given trial 
length it is better to present few cate- 
gories with many possible occurrences of 
each than to present a large number of 
Categories with fewer occurrences of each. 
The similarity of this finding suggests 
that rehearsal may also be critical in the 
task used by Yntema. 

Assuming that task difficulty was in- 
creased by increasing the trial length, the 
number of categories presented, and the 
rate of presentation, it is not surprising 
to find differential practice effects as a 
function of the first two parameters— 
generally speaking, the more difficult a 
task, the greater the improvement in per- 
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formance with practice. It is somewhat 
surprising, however, that there were no 
differential practice effects associated 
with rate of presentation. A possible 
explanation might be that an insufficient 
number of trials were run for practice to 
begin to have a differential effect on per- 
formance as a function of rate. As 
indicated above, questioning of Ss after 
testing revealed that the large majority 
cast about for a counting technique and 

tried several before they found what they 
considered to be a satisfactory one. It is 

possible that at the faster rates where the 

greatest gain might be expected, Ss spent 

more trials searching about for a tech- 

nique than they did at slower rates and 

thus delayed the benefits to be derived 

from practicing with a specific technique. 

This possibility should be the subject of 

further research. 
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EFFECT OF IRRELEV 


NT INFORMATION ON A COMPLEX 


AUDITORY-DISCRIMINATION TASK? 
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An attempt was made to demonstrate that the detrimental effects of 
irrelevant information on discrimination learning are due to implicit 
response competition generated by task conditions. In a complex 
auditory-discrimination task, groups receiving different amounts of 
irrelevant information (1, 2, or 3 dimensions) never relevant to their 
task made fewer errors than groups receiving different amounts of 
irrelevant information which sometimes required differential responding. 
In addition, the errors increased with the number of irrelevant dimen- 
sions. Variation of task difficulty by manipulating the discriminability 
of the relevant information resulted in an enhancement of the effects of 
the irrelevant information. Repeated practice sessions reduced the 
effects of irrelevant information which was sometimes relevant at a 
faster rate than for the irrelevant information which was never relevant 


to the task. 


In many situations humans are 
faced with discrimination tasks made 
difficult by the presence of extraneous 
or irrelevant factors. Studies of tasks 
in which signals must be detected or 
identified from among others of a 
similar nature show performance de- 
crements brought about by irrelevant 
factors, e.g., Pollack (1955); Poulton 
(1953); Peters (1954); Pollack and 
Klemmer (1954); Egan, Carterette, 
and Thwing (1954); Webster and 
Thompson (1954). Although these 
studies show that the presence of in- 
formation irrelevant to the task 
seriously degrades performance, little 
attempt was made to specify the locus 
of the interference. In. addition, 
certain seemingly contradictory evi- 
dence exists. Tor example, Archer 
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(1954) found that the presence and 
amount of irrelevant information had 
little, if any, effect on the proficiency 
of identifying relevant dimensions in 
a complex visual discrimination task. 
In experiments involving speech com- 
munication Cherry (1953), Webster 
and Solomon (1955), and Spieth and 
Webster (1955) report that under 
certain conditions irrelevant informa- 
tion interferes with the reception of 
relevant words while under other con- 
ditions it has less or no effect. This 
implies that there are degrees of 
interference by irrelevant information, 
the level perhaps being determined by 
aspects of the task. Archer (1962) 
indicated that the discriminability or 
obviousness of irrelevant information 
plays an important part in determin- 
ing the effects of irrelevant informa- 
tion. Broadbent (1958) makes a 
similar point regarding selective atten- 
tion. When irrelevant dimensions are 
obvious or attention-getting they 
elicit responses which interfere with 
performance in comparison to situa- 
tions where they are less obvious. 
When the relevant and irrelevant 
dimensions are equally obvious con- 
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cept identification is difficult (Archer, 
1962). The level of interference 
should also be a function of the simi- 
larity between relevant and irrelevant 
dimensions. Peters (1954), Speith 
and Webster (1955), and Broadbent 
(1958), among others, report that as 
the irrelevant information is made 
physically dissimilar to the relevant 
information, e.g, by filtering irrele- 
vant vocal messages, there is little 
performance decrement. Therefore, 
it seems reasonable to assume that 
implicit response competition pro- 
duces the effects of irrelevant informa- 
tion and that when conditions obtain 
which provide for increasing amounts 
of response competition, serious per- 
formance decrements in the form of 
errors and/or response speed should 
be observed. 

Hodge (1959) provided a direct test 
of this assumption. In his task, con- 
text within a trial determined rele- 
vancy, that is to say, dimensions 
irrelevant to a particular discrimina- 
tion trial were relevant On others 
providing conditions for high levels 
of response competition and poor 
Performance. Irrelevant dimensions 
Present on a trial which were relevant 
On other trials interfered with per- 
formance more than irrelevant dimen- 
sions which were never relevant to the 
task. The present experiment Was 
designed as an extension of Hodge's 
using auditory stimuli. Briefly stated, 
Its purpose is to demonstrate that in 
Making complex discriminations, when 
Conditions are provided for increasing 
amounts of response competition par 
;ormance is progressively degraded. 
Two conditions of relevancy Were use 
to produce different levels of respons? 
Competition, ie., conditions where 
irrelevant dimensions which wert 
Sometimes relevant to the task Wer? 
contrasted with conditions where the 


irrelevant dimensions were never rele- 
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vant to the task. Since more interfer- 
ence should be produced by greater 
amounts of irrelevant information, 
one, two, or three irrelevant dimen- 
sions were used. In addition, in order 
to determine whether the effects of 
kind and amount of irrelevancy are 
enhanced by task difficulty, the 
difficulty of discrimination was varied. 
Three practice sessions were given to 
observe any systematic changes in the 
effects of the other variables with 
increasing experience. 


METHOD 


Subjects —The Ss were 42 undergraduate 
students attending San Diego State College. 
They were paid for their participation. 

Apparatus.— The apparatus consisted of a 
tape playback unit, with earphones and a set 
of response switches for S, and the appropriate 
recording devices. The stimulus material was 
received from the tape recorder at Ss’ PDR-8 
earphones at about 80 db. Four response 
switches were mounted in a small (10 X 4x9. 
in.) box and could be moved in two directions. 
‘The switches were spring loaded and returned 
to neutral (central) position when released. 
The Ss held their hands on the response board 
at all times. When S moved a switch away 
from its central or rest position a correspond- 
ing indicator light lit on E's control panel. 
Stop clocks calibrated in .01 sec, measured 
response latencies. The clocks were activated 
at stimulus onset and stopped when S made 
his response. Three sets of earphones and 
response boxes were provided so that three 
Ss could be run at one time. 

Stimulus material.—The stimuli were 4-sec. 
samples of a combination of dimensions ob- 
tained using a white noise and pure tones. 
As many as six bivalued dimensions were 
encoded in a sample. Two dimensions were 
present in all samples while the occurrence 
of the older four depended upon the experi- 
mental conditions holding during a particular 
sequence. These will be referred to as pri- 
mary and secondary dimensions, respectively. 

The basic signal was a white noise which 
was modulated slowly, yielding a rhythmic 
sound waxing and waning about once every 
second. Some dimensions were obtained by 
varying this signal, others were obtained by 
the addition of other signals. The two pri- 
mary dimensions and their discriminable 
values were as follows: (a) Pitch. The white 
noise was passed through a 4-octave filter and 
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'TABLE 1 


EXAMPLES OF THE LABELING FOR THE RESPONSE BOARD 


FOR THE DIFFERE) 


l IRRELEVANCY GROUPS 


Sometimes Relevant Group 


Never Relevant Group 


High Pitch Low Pitch 


High Pitch Low Pitch 


Loud Soft Soft Loud Loud Soft Soft Loud 
Fast High Accent Sweep Fast Fast Fast Fast 
Beat Warble 2 Up Beat Beat Beat Beat 
+a +a +a +a +a ^ ^ m 
Slow Low Accent Sweep Slow Slow Slow Slow 
Beat Warble + Down Beat Beat Beat Beat 


^ * indicates switch position. 


two values were chosen and designated as high 
and low pitch. The center frequencies for the 
high and low pitches were set at 4,000 cps 
and 800 cps, respectively. (b) Loudness. 
Two levels were designated as loud and soft. 
Since loudness and pitch interact, a psycho- 
physical study was run with eight Ss in order 
to obtain samples that sounded equally loud. 
The four secondary dimensions and their 
values were: (a) Beat speed. The rate of 
modulation of the white noise was varied. 
Two different rates were called fast (6/sec) 
and slow (3/sec). (b) Accent. Certain of the 
beats were accented or emphasized. That is 
to say, their level was raised several decibels 
above the other beats. The judgment re- 
quired concerned which beat was accented. 
Either every second or every fourth beat was 
accented. (c) Tome sweep. During the sample 
a tone was swept up or down in frequency be- 
tween 200 and 2,000 cps. (d) Warble tone. 
Warble tones of two different frequencies, low 
or high (500 cps and 1,500 cps) were inserted 
into the sample. 

Stimulus samples were constructed from 
various combinations of the dimensions listed 
above and were recorded on j-in. magnetic 
tape. The two primary dimensions, pitch and 
loudness, were present In all samples. The 
use of the secondary dimensions was deter- 
mined by the experimental conditions. Sev- 
eral sequences of 128 samples were made 
corresponding to variations in the task re- 
quirements and in the number of irrelevant 

i s. 
S ALIO to the difference among a 
samples’ dimensions listed above, the high ai 
low pitches of the basic signal pere 
systematically in order to make task ¢ i T E 
a variable. A pitch continu. was bei y 
varying the basic signal in discrete $-octave 


steps between 800 cps and 4,000 cps (center 
frequencies) on a Bruel and Kjaer j-octave 
filter. In this way, a pitch continuum of eight 
distinct pitches was formed, i.e., 800; 1,000; 
1,250; 1,600; 2,000; 2,500; 3,200; 4,000 cps. 
The lower four }-octave bands were desig- 
nated as low and the four high bands were 
designated as high pitch. Stimulus samples 
from the center of this continuum make diffi- 
cult the identification of the sample as high or 
low in pitch. ‘The psychophysical study men- 
tioned above established levels for the differ- 
ent pitches so that they would sound equally 
loud. 

Task and experimental design.— The Ss 
were required to identify the stimulus samples 
in terms of the value of each of three dimen- 
sions present. They had to identify the value 
of pitch (high or low) and loudness (loud or 
soft) of the basic signal present in each 
sample. Then, Ss identified the value of the 
relevant, third dimension. The movement of 
a switch documented Ss’ judgments about the 
signal. Each switch represented a unique 
combination of pitch, loudness, and relevant 
secondary dimension. The other secondary 
dimensions were present in different numbers 
for different Ss allowing variation of the 
amount of irrelevancy. Task conditions were 
varied by having some Ss respond to all four 
of the secondary dimensions on different trials 
during a sequence while others responded to 
only a single secondary dimension during the 
entire sequence. 

Four major variables were manipulated. 
Task or kind of irrelevancy and amount of 
irrelevant information were between-Ss vari- 
ables and discrimination difficulty and 
practice were within-Ss variables, Thirty-six 
Ss were divided randomly into two major 
groups receiving the different kinds of irrele- 
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vant information, sometimes and never relevant. 
Table 1 displays examples of the response- 
board labeling used for the two major groups 
which should prove useful for understanding 
the differences between the tasks for the two 
groups. Stimulus sequences were prepared 
which required one group to respond to all 
four of the secondary dimensions an equal 
number of times during the sequence of trials. 
On a trial Ss identified pitch, loudness, and 
relevant secondary dimension with the other 
dimensions irrelevant on that trial. However, 
the other dimensions were sometimes relevant 
in that it was necessary to identify them on 
other trials. During other sequences the 
second major group always identified the 
values of the two primary dimensions and one 
of the four secondary dimensions. The re- 
maining dimensions were never relevant to 
their task and were never discriminated al- 
though they might be present in a sample. 
Due to the small number of Ss used, complete 
counterbalancing, using all four secondary 
dimensions as the relevant dimension for 
different Ss, was not undertaken. In two 
Sequences either beat speed or tone sweep 
Served as the fixed secondary dimension for 
the never relevant group. Half the Ss in the 
never relevant condition discriminated be- 
tween the two beat speeds and the other half 
between the tone sweep up or down. ‘The 
results revealed no systematic difference 
between the two dimensions. 

In summary, Ss in the sometimes relevant 
Condition identified the pitch and loudness of 
the basic signal and then identified the value 
9f the relevant secondary dimension by 
Moving the switch in the appropriate direc- 
tion, The procedure was the same for Ss in 
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the never relevant condition except that only 
one secondary dimension was associated with 
all pitch-loudness combinations of the basic 
signal. These groups provide a comparison 
of results obtained under conditions where 
specific response competition is likely and 
where it is not. j 

To aid further in understanding the con- 
struction of the stimulus samples and the 
discriminative responses required of Ss in the 
different conditions Table 2 is presented. 

Equal subgroups within each major group 
were formed (» = 6) and received different 
amounts of irrelevant information. In the 
simplest condition, one irrelevant dimension 
was added to the relevant stimulus dimen- 
sions during a sequence, and similarly, two 
and three dimensions were added to the 
sample for the other groups, In all sequences 
three dimensions were relevant in a sample 
with either one, two, or three irrelevant di- 
mensions present. 

During all sequences discrimination diffi- 
culty was varied by introducing the 8-step 
pitch continuum. ‘There were 16 samples of 
each }-octave filter setting during the 128 
trial sequences given to all Ss. All Ss received 
one practice session on each of 3 days. 

For comparison purposes, à control group 
= 6) was given an experimental sequence 
never relevant condition 
information was 


(n 
identical to the 
except that no irrelevant 
present. " A 

Procedure. —The Ss were all given instruc- 
tions and preliminary training to familiarize 
them with their discrimination task and the 
response to be made. When Æ was sure that 
Ss understood their task 20 preliminary 
“warm-up” trials were given during which 


TABLE 2 


5 Terram G 
Summary or THE DIMENSIONS Usep TO G 


AND THE DISCRIMINATIONS 


CONDITIONS OF 


REQUIRED UNDE 


THE STIMULI, THEIR VALU 
R THE DIFFERENT 
IRRELEVANCY 


ERA 


Irrelevancy Cond. 


Dimension Values Never Relevant | Sometimes Relevant 
a rare aim ea er 
Primary i identit ganon : identification : 
Pitch high-low (pitch varied) reed required 
Loudness loud-soft requiret | required 
Second: " 
p m TENES  required* " sometimes required 
Recent n never requires sometimes required 
Tone sweep up-down Ms zeguired sometimes required 
(Vasble touc high-low never required | sometimes required 
resp, alf of the Ss in these groups were rene ‘tg (deat GRE on cach trial, The other Ss 


nded to this dimension and discriminate 
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Vic. 1. Average latency as a function of 
position on the pitch continuum of the basic 
signal and kind of irrelevant information. 
(The central positions [4] represent the 
pitches most difficult and the ends [1] the 
easiest to discriminate. Control Ss received 
no irrelevant information.) 


knowledge of results was given. Then the 
appropriate sequence of 128 samples was 
given. During the experimental sequence Ss 
were not told which response was correct. 
After each 4-sec. sample, the tape recorder 
was stopped until the responses and times 
were recorded. Then the next sample was 
played. Whenever possible, three Ss in a 
group were run at once. On the next 2 days, 
after short instructions reviewing the proce- 
dure, the 20 warm-up trials were given and 
then the 128-trial sequence. 

The response measures used were errors 
and response latency. ‘The responses were 
recorded by E for scoring at a later time. 
Latency, the time between the onset of the 
sample and the response, was recorded at this 


time. 
RESULTS 


The basic data analyzed in the ex- 
periment were the 24 average latencies 
and error proportions made by each F 
at the eight levels of Difficulty during 
the 3 days of Practice. | These data 
were subjected to analysis of pec 
using Kind and Amount of Irrelevan 
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Information as between-Ss and Diffi- 
culty and Practice as within-Ss 
variables. 

The analysis of the latency data 
revealed significant effects due to 
Difficulty and Practice only, F (7,210) 
= 12.645; F (2,60) = 22.490, p < .001 
for both tests. The effects due to 
Kind and Amount of Irrelevant In- 
formation did not reach significance. 
The Kind of Irrelevant Information 
interacted significantly with Diffi- 
culty, F (7, 210) = 2.580, p < .025. 
Figure 1 shows the average latencies 
for the different irrelevancy conditions 
as a function of discrimination diffi- 
culty. Each datum point is the 
average of 48 samples for each pitch 
for each of 18 Ss. The data for the 
control group were not included in the 
analysis and are shown for comparison 
purposes. Only one other interaction 
effect contributed enough variance to 
allow rejection of the null hypoth- 
esis. A significant Difficulty X Kind 
x Amount of Irrelevant Information 
triple interaction was found, F (7, 210) 
= 2.038, p < .05. 

The analysis of the errors revealed 
clear-cut effects of all major variables. 
Figure 2 shows the average proportion 
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Fic. 2. Average proportion of errors as a 
function of the amount of irrelevant in- 
formation. (Each datum point represents the 
average for a different group of Ss.) 


of errors as a function of the number 
of irrelevant dimensions for the two 
irrelevancy conditions. Significantly 
more errors were produced by irrele- 
vant information which was some- 
times relevant in comparison with 
that which was never relevant, 
F (1, 30) = 30.469, p « .001. In ad- 
dition, increasing the number of 
irrelevant dimensions produces sig- 
nificantly more errors, F (2, 30) 
= 6.682, p < .01. The average pro- 
portion of errors made by the com- 
parison control group is shown in the 
figure also. The results of Dunnett's 
test (Edwards, 1962) which compares 
all six groups with the control, reveals 
that all three groups receiving the 
sometimes relevant, irrelevant in- 
formation and the group receiving 
three irrelevant dimensions never 
relevant to their task, differ signifi- 
cantly (p < .05) from the control Ss. 
In the same analysis, using the aver- 
age latencies, Dunnett's test revealed 
significant differences between any of 
the experimental groups and the 
control group. As the task of identi- 
fying the pitch of the sound was made 
more difficult, significant increases in 
errors resulted, F (7,210) = 43.667, 
b < .001. On the other hand, Prac- 
tice rapidly reduced the number of 
errors made, F (2 60) — 58.147, 
b « .001. 

'The differences between the error 
Proportions for the two kinds of 
irrelevancy were not the same for 
all levels of Difficulty and Practice. 
"he Kind of Irrelevancy and Difh- 
culty interaction reached significance, 
F (7, 210) = 2.093, Ż < 05. Figure 
3 shows the error data as @ function 
of discrimination difficulty. Once 
again the data for the control group 


are shown. , 
Practice also interacts significantly 


With the kind of irrelevancy» P Q. 60) 
= 4151, p « .025. The magnitude 
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Fic, 3. Average proportion of errors as a 
function of position on the pitch continuum 
(difficulty) and the kind of irrelevant in- 
formation. 


of the difference between the some- 
times and never relevancy conditions 
decreases somewhat with practice. 
The only other interaction reaching 
significance was that between Diff- 
culty, Practice, and Amount of Irrele- 
vant Information, F (28,420) — 1.518, 
p < .05. However, examination of 
average curves for these data reveal 
no systematic variation of the Diffi- 
culty and Amount of Irrelevancy 
interaction from day to day. 


Discussion 


The error data generally present a 
consistent picture of the detrimental 
effects of irrelevant information on com- 
plex discrimination learning. When in- 
formation is irrelevant to a present 
discrimination but is relevant at other 
times it significantly degrades perform- 
ance. This suggests that the locus of 
interference attributable to irrelevant in- 
formation is in the response competition 
generated by implicit responses elicited 
by the nonrelevant dimensions. The fact 
that more interference results with larger 
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numbers of irrelevant dimensions pro- 
vides further support for the hypothesis. 

On the other hand, the latency data 
reveal no effect due to the irrelevancy 
conditions. This result was not unex- 
pected, and probably was due to task 
conditions. The auditory-discrimination 
task required Ss to judge dimensions 
many of which were time dependent. 
This would tend to attenuate any differ- 
ences between groups. Support for this 
explanation is revealed also in the lack of 
any significant differences between the 
average latencies for the control group 
and the various experimental groups 
using Dunnett's test. 

In the analysis of both errors and 
latencies significant Kind X Difficulty 
interactions were revealed. Examination 
of Fig. 1 and 3 reveals somewhat longer 
latencies and greater proportion of errors 
although the curves are somewhat flatter 
for the sometimes relevant condition in 
comparison with the other conditions. 
'These results may be due to a greater 
amount of response competition gener- 
ated both by the sometimes relevant 
information and the difficulty of making 
the initial discrimination. 

The significant Practice X Kind of Ir- 
relevancy interaction indicates that the 
difference between groups receiving the 
two kinds of irrelevancy decreases with 
practice. Unfortunately, three practice 
sessions were insufficient to determine 
whether this convergence continues. 
This possibility, however, raises an in- 
teresting question about the manner in 
which the effects of irrelevant informa- 
tion are reduced by practice. The result 
in Hodge's (1959) study led him to a 
similar concern about whether Ss learn 
to ignore the irrelevant dimensions or 
merely learn responses to the relevant 
stimuli better. Ignoring stimuli which 
sometimes require | ip arp Eas 
sponses would require a mih y ^ ive 
filtering of the informationa inpu ^s hich 
could be acquired only by responding in 

sence of the competing informa- 
the i^ learning relevant responses 
ediates this convergence, practice in the 
absence of irrelevant 1n ormatio 
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be effective. The resolution of this prob- 
lem awaits future research. 
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A TEST OF THE "LIMITED 


CAPACITY" HYPOTHESIS ! 
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If as suggested recently an individual does have a limited capacity for 


recall in STM, then simultaneous va 


riation in number of presentations 


and rate of presentation (with total presentation time constant) should 


result in equal recall. 
tion of 6 pairs at a rate of 4 sec/pair, 
presentations at 1 sec/pair, then test 
of the 6 A-B pairs. 
were indistinguishable, while the 2nd 


shorter retention intervals. 
was concluded that the limited capac! 


approximation. 


Recently it was suggested (Mur- 
dock, 1964) that, in STM, individuals 
may have a limited capacity for im- 
mediate recall. Further, it was sug- 
gested that the limitation may be on 
the rate of processing information. If 
so, then one should be able to trade off 
number of presentations and presenta- 
tion rate if total presentation time is 
held constant. For instance, if each 
pair in a list of paired associates Is 
presented for a total of 4 sec., the list 
can be presented once at a presenta- 
tion rate of 4 sec/pair, twice at a rate 
of 2 sec/pair, or four times at a rate 
of 1 sec/pair (symbolized 1-4, 2-2, and 
4-1, respectively). The hypothesis 
would predict equal recall under these 
three conditions; the present experi- 
ment is a test of this hypothesis: 


METHOD 
To facilitate comparison with other studies 
the procedure was essentially that used oer 
ously (ec, Murdock, 19632, 63b). Al 
*B pairs were compose of common Englis h 
Words paired at random, and each list con- 
ix pairs. After 1-4, 2-2, or 4-1 nS 
pairs was tested by presenting 2 as 
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To test this hypothesis, Ss were given 1 presenta- 


2 presentations at 2 sec/pair, or 4 
ed for recall of the B member of 1 


Recall probabilities for the 1st and 3rd condition 


condition was superior only at the 


Considering this and other evidence, it 


ity hypothesis may be a useful Ist 


the cue for recall of B. Each of the six serial 
positions was tested equally often under the 
three experimental conditions. Thus, one 
replication required 18 lists. In all there were 
13 replications; group testing was used, and 
all Ss (N20) were tested on all 234 lists. 
The four test sessions required were held at 
the same time (11 :00 A.M.) on four consecu- 
tive Tuesdays. 

The same filmstrips that had been used 
previously (Exp. III, Murdock, 1963b) were 
used here. For the two conditions that re- 
quired repetition (ie., 2-2 and 4-1), after 
the last pair in the list had been shown for the 
requisite time, the stripfilm was rolled back 
to the first pair in the list. The roll-back took 
no appreciable time and was just a rapid blur 
on the screen. This procedure was repeated 
twice more for the 4-1 condition. Thus, 
with repetition the same serial order of the 
pairs was preserved, and for all practical 
purposes there was no intertrial interval. 
Actually, this lack of an intertrial interval is 
one of the two procedural differences between 
the 2-2 condition of the present experiment 
and the two-repetition condition of a previous 
study (Exp. III, Murdock, 1963a). The 
other difference is that, in the present experi- 
ment, Ss were given information before each 
trial as to whether the presentation rate would 
be “fast,” "slow," or “average.” 

Naturally, the critical pair was in no way 
distinguished prior to the recall test. The 
recall period was 15 sec., and a 5-sec. ready 
period preceded each list, The order of 
presentation of the 18 conditions within each 
replication was randomized. With group 
testing the specific A-B pairs could not be 
counterbalanced across the 18 conditions; 
however, with a total of 1,404 different word 
pairs randomly distributed, it seemed un- 
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of number of subsequent pairs. 


likely that any systematic differences in item 
difficulty could occur to bias the results, The 
Ss were students of both sexes from the 
introductory psychology course who were 
fulfilling a course requirement. 


RESULTS 


The best overall measure of perform- 
ance is the area under the serial-posi- 
tion curve (i.e., the sum of the p values 
for the six different serial positions). 
For the 2-2 condition the value was 
3.12; the comparable value for the 
two-repetition condition of the previ- 
ous experiment (Murdock, 1963a) 
was 3.47. The elimination of the 
2-sec. intertrial interval reduced the 
total presentation time by 8%; the 
fact that the recall performance was 
reduced by 10% does not seem dis- 
proportionate. 

The main results of the present 
experiment are shown in Fig. 1, where 
probability of recall is plotted as a 
function of number of subsequent 
pairs for each of the three conditions 
separately. It is apparent that the 
results for 1-4 and 4-1 are almost 1n- 
distinguishable ; however, the p values 
for 2-2 are somewhat higher at the 
shorter retention intervals (n par- 
ticular, one and two subsequent pairs). 

The above conclusions are sub- 
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stantiated by analyses of variance of 
number of correct recalls. Number of 
subsequent pairs, experimental condi- 
tions, and their interaction were all 
significant at well beyond the .001 
level. Duncan’s multiple-range test 
showed that, at the .01 level, 2-2 was 
better than either 1-4 or 4-1 but the 
latter two did not differ significantly. 
Separate analyses of variance at each 
of the six retention intervals showed 
that the experimental conditions were 
significant only for one and two subse- 
quent pairs; at both these retention 
intervals p < .001. Finally, it has 
been suggested (Murdock, 1963a) 
that, after two subsequent pairs, the 
retention curve is essentially asymp- 
totic. Therefore, the three longest 
retention intervals (i.e., three, four, 
and five subsequent pairs) were an- 
alyzed separately ; an analysis of vari- 
ance showed that the effects of neither 
subsequent pairs nor experimental 
conditions were statistically significant 
(in both cases, p > .05). 

As stated above, the area under the 
serial-position curve was 3.12 for the 
2-2 condition. The comparable values 
for 1-4 and for 4-1 were 2.62 and 
2.70, respectively. Thus, in terms of 
overall performance, the latter two 
conditions were about 15% poorer 
than the 2-2 condition. However, as 
indicated by the statistical analysis, 
most of the difference appeared to be 


TABLE 1 


PROPORTION OF OMISSIONS, EXTRALIST 
INTRUSIONS, AND INTRALIST A 
AND B INTRUSIONS FOR EACH 

EXPERIMENTAL CONDITION 


14 2-2 4-1 
Omissions .558 | .487 | .407 
Extralist intrusions 3200. | 179 | A27 
Intralist A intrusions | .079 | .085 | .113 
Intralist B intrusions | .163 | .250 | .353 


THE LIMITED CAPACITY HYPOTHESIS 


concentrated at one and two subse- 
quent pairs. 

There was a total of 2,496 non- 
correct responses, and an analysis of 
errors is shown in Table 1. This table 
shows the proportion of omissions 
(failure to respond), extralist intru- 
sions (any word not in the list), and 
intralist intrusions (amy incorrect 
word from the list) broken down into 
A and B items. As number of pres- 
entations increased from one to four, 
both omissions and extralist intrusions 
decreased but intralist intrusions (both 
A and B) increased. Perhaps the most 
important point to note about the 
error analysis is that, for all four 
categories, the trend was monotonic; 
in no case was there à maximum or à 
minimum for the 2-2 condition. 

To test for practice effects the first 
three replications were compared with 
the second three replications. An 
analysis of variance of the number of 
correct recalls showed that neither 
replications (ie, 1-3 VS- 4-6) nor 
the interaction of replications by con- 
ditions (ie. 1-4, 2-2 4-1) had a 
statistically significant effect on re- 
call (in both cases P > .05). An in- 
spection of the data gave no indica- 


tion that learning to learn and pro- 
active inhibition were counteracting 
t Serial 


each other. Performance à 
Position 6 (probably the best measure 
of degree of original learning) did not 
appear to improve ove these first six 
replications; neither did performance 
at Serial Positions 1-3 deteriorate over 
this period. However: such changes 
may have occurred within the first 
replication itself (see Murdock, 1964). 
Finally, as the number of pe 
tions increased from one to, four, the 
Opportunity for serial learning of te 
six pairs increased. The same seri 
Order was used with repetition-) 1 One 
measure of such serial learning is the 


e > H 
'adjacent/penultimate 
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In previous studies we have found 
that about two-thirds of the intralist 
intrusions are either from adjacent 
serial positions (i.e., immediately be- 
fore or immediately after the critical 
pair) or from the penultimate pair in 
the list (i.e., Serial Position 5). With 
serial learning of the pairs the former 
could be expected to increase relative 
to the latter (see Slamecka, 1964) and, 

in one study (Exp. Ill, Murdock, 

1963a), this ratio was 1.08, 1.38, and 

1.61 for one, two, and three presenta- 

tions, respectively. In the present 

study the A/P ratio was 1.48, 1.41, 

and 1.63 for one, two, and four presen- 

tations, respectively. Thus, it would 

seem as if relatively little serial learn- 

ing occurred ; presumably the faster 

presentation rate worked against the 

factor of repetition. 


DISCUSSION 


In general the "Jimited capacity" hy- 
pothesis would seem to be confirmed. 
For the longer retention intervals (i.e., 
3-5 subsequent pairs) the three condi- 
tions did not differ significantly, and 
at the shorter retention intervals (i.e. 
1-2 subsequent pairs) 1-4 and 4-1 
did not differ significantly. Also, the 
trend of omissions, extralist intrusions, 
and intralist A and B intrusions was 
monotonic}; thus, this measure of reten- 
tion failed to show any evidence of curvi- 
linearity across conditions. The one 
disquieting feature was the superiority 
of 2-2 at the shorter retention inter- 
vals. However, there is some evidence 
already that there are two components 
to the STM paired-associate retention 
curve (Murdock, 1963a), yet even here 
the differences that did exist were only 
of the order of 15%. 

Recent studies by Bugelski (1962) and 
by Bugelski and Rickwood (1963) are 
particularly relevant to the present find- 
ings. In the former study a list of paired 
associates was learned to criterion at 
one of five different presentation rates. 
While the number of trials to criterion 
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increased with presentation rate, the 
total exposure time for the five groups 
was essentially the same. In the latter 
study the trials were self-paced, yet the 
total learning time was clearly within 
the range established in the first study. 
Obviously, these results are just what 
would be expected by a hypothesis of 
limited capacity. Additional support 
comes from Nodine (1963) who reported 
that, “. . . an eightfold increase in ex- 
posure durations . . . produced slightly 
less than an eightfold increase in total 
correct responses [p. 1057." 

There is some evidence, then, that 
with paired associates a constant amount 
of material can be learned or retained 
given a constant amount of time. The 
same time constancy seems to operate in 
free recall (Cohen, 1963; Murdock, 1960; 
Waugh, 1962, 1963). If the limitation is 
on the rate of processing information, 
then these studies in verbal learning 
would agree well with studies of choice 
reaction time (Leonard, 1961). All 
things considered, it would seem that the 
limited capacity hypothesis may be a 
useful first approximation. 
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SIMILARITY IN STIMULUS MATERIAL AND STIMULUS 
TASK ON THE FORMATION OF A NEW SCALE 
OF JUDGMENT 


M. E. TRESSELT 


New York University 


180 Ss were tested individually, 36 Ss in each of 5 groups, in order to 
examine the effects of 2 different prior set task situations and 2 different 


prior set anchorin 
of judgment. 


test series of 12 weighted cardboard cont 
ach weight as heavy, medium, or light. 2 of the experimental 


to judge e 
groups were asked 1s 
gm.) ranking them in 
tic weighted conta 
series of weights. 

alphabetize books or a 
irrelevant task before being given 

tainers. The results confirm earlie! 
shows a gradual adjustment of the s 


scale of weights 0 
those Ss having hi 
greater displacement of the new 
Further, the task situation, 1.¢., 
produce any significant displacement 


t to make judgm 


Iphabetize the 
t 


Whenever an S is faced with a 
series of stimuli and required to make 
judgments along a given continuum, 
some anchoring stimulus or stimuli 
usually are used as referents. The 
anchor value may be one that 1s 
Present, i.e., a 12-in. ruler for length; 
or an anchor that is not present but 
Which is in the past experience of S, 
*-g., a memory of how long the longest 
line he has seen may have been, or 
Perhaps, the standard referent can be 
the form of an instruction such as 
the instruction to consider a yard as a 
long line, a foot as a short lie. —— 

When S is faced with a situation in 
Which he is aked to make judgments 
of a series of weights, for example, to 
Scale them as heavy; medium, and 
light, he would call from his memory 
as representative anchoring weights 
referents of similar properties to the 
Stimuli with which he is faced, and 
make comparisons. Rogers 


(1941) 
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ver a period of exposure. 


andled the plastic weig! 
scale than the groups exposed to books. 


weighing or alphabetizing, failed to 


g stimulus materials on the formation of a new scale 
The control group was given a prearranged order of a 


ainers (11-560 gm.) and asked 


ents of a series of books (612-1,184 


heaviness or to make judgments on a series of plas- 
iners (612-1,184 gm.) before being given the test 
2 other experimental groups were asked 1st to 


plastic weighted containers as an 
he prearranged 12 weighted con- 


r research that the control group 


le of judgment to the objective 
Also the results show that 
hted containers 1st showed 


ca. 


using the method of single stimuli, 
called for judgments in six absolute 
categories with and without an ex- 
perimental anchoring stimulus being 
present, and so demonstrated assimi- 
lation to various degrees of displace- 
ment of the anchoring stimulus from 
the critical series of weights. Tresselt 
(1947) conducted an experiment in 
which practice was given on a range 
of weights before Ss were given an 
expanded range and found that the 
judgments came to agreement with 
the new expanded scale. A weight, 
then, that was perceived as “light” in 
the practice scale and now was a 
middle weight of an expanded scale, 
became “medium” as S experienced 
the new stimulus range. 

In order to demonstrate the effect 
of similar and nonsimilar materials as 
referents, Brown (1953) used weights 
and metal trays as anchor materials. 
He found the greater effect on a 
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subjective scale of weights was pro- 
duced by the anchor weights (similar 
materials) as compared to the tray 
anchor weights (dissimilar materials). 
He also included anchor weights where 
no judgment of them was required and 
found these more effective on the 
critical series than tray anchors not 
judged. The anchor weights not 
judged were less effective in in- 
fluencing judgments than the judged 
anchor-weight condition, and the non- 
judged tray anchors were the least 
effective of all his conditions. 

The effect of nonrelevant aspects in 
the judgmental situation has been 
studied most recently by Rambo 
(1962) who presented his Ss with 
black dots on white cards where both 
dot size and card size varied in 
different relationships, i.e., random, 
positively related, negatively related, 
etc. He found that incidental ex- 
posure of stimuli acted as anchors for 
the intentional judgments made later 
to arestricted distribution of the same 
stimuli. 

In these latter two experiments, S 
is presented with material which he 
is not to judge or about which he is 
not required to make a verbal report 
in the anchoring conditions. The 
question may be raised that Ss may 
well have been suspicious of the 
materials and although nonverbalized, 
they may have utilized the scale 
referents because of the artificiality 
of a laboratory situation. Johnson 
(1944) points out that in his experi- 
ments, Ss would often move a chair 
or lift a book during their rest pauses 
without affecting their scale of values 
which were based on lifting stimulus 
weights. 

There are two problems, then, to be 
investigated here. The first problem 
relates to the similarity of materials 
constituting both the anchoring, and 
test series of stimuli. It is predicted 
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that the scale of judgment of a group 
of Ss given anchoring materials similar 
to the test series, i.e., weighted con- 
tainers, will show a greater displace- 
ment in the formation of their new 
scale than a group of Ss given anchor- 
ing stimuli of dissimilar materials, 
ie. books. The second problem 
examines the part played by degree of 
similarity in task in the formation of 
a new scale of judgment. Tresselt 
(1948) using professional weight lifters 
and watchmakers as Ss and using a 
series of 12 weighted containers as 
stimulus materials found that the 
center of the scale of the weight lifters 
was significantly different from the 
scale of the watchmakers. The hy- 
pothesis was suggested that the more 
similar the previous task to the task 
at hand, the more apparent would be 
the effect of the old scale on the 
adjustment to a new scale. It is 
predicted, therefore, that Ss required 
to make judgments of a similar nature 
to both the anchoring stimuli and the 
test stimuli, i.e., weigh them, will be 
more affected by the anchoring stimuli 
in forming a new scale than a group 
of Ss required to do a dissimilar task, 
ic., alphabetize the anchoring series 
and to weight the test series. All of 
the experimental groups should show 
more of an effect on the center of the 
scale than a control group having a 
heterogeniety of previous experiences 
prior to the test series of stimuli. 


METHOD 


Apparatus and Procedure 


There were two kinds of anchoring 
stimuli: books and weights. The five books 
alphabetically by author and by weight were 
in an increasing order, i.e., one by Allport 
weighed 612 gm.; Bills, weighed 841 gm.; 
Crane, 927 gm.; Guilford, 1,138 gm.; and 
Patterson, 1,184 gm. The other set of anchor- 
ing stimuli were five weighted black plastic 
containers (larger in size than the test stimuli) 
labeled All (612 gm.); Bil (841 gm.); Cra 
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(927 gm.); Guil (1,138 gm.); and Pat (1,184 
gm.) 

The test stimuli were 12 weighted cylin- 
drical cardboard containers whose weight in 
grams were 11, 60, 110, 160, 210, 260, 310, 
360, 410, 460, 510, and 560. 

There were five groups of Ss, 36 Ss in each 
group, a total of 180 Ss tested individually. 
Group A (the control group) was given only 
the 12 test weights in a prearranged order and 
asked to judge the weights as heavy, medium, 
or light. "The order of the test weights for the 
experiment as a whole was such that each 
weight was presented in a different serial 
Position to each S. Thus the 12 weights were 
lifted with equal frequency in each of the 12 
serial positions, In this way, for example, the 
ll-gm. weight was in the first position for 
three Ss in any group; in the second position 
for three more, the third for each of three 
more, and so on through the twelfth position. 
All Ss were blindfolded during the test series. 

The Ss in Group B were told on entering 
the office for testing that by accident some 
books were left right in the middle of the 
testing table either because E was trying to 
Straighten the office or because someone had 
brought the books back during the time Æ 
Was absent, The S was asked to give the 
books to E in alphabetical order one at a time 
So that they could be placed into the book- 
Case. The Ss were then told to sit down to 
do the experiment which was administered in 
the same way as for Group A. 

Group C was informed upon entering the 
Office that someone previously had dropped 
One of the plastic containers of weights (a 
broken one was on the desk) and since she 
Was not going to use this particular series any 
More Ss were asked to give her the weights in 
alphabetical order (as labeled) so that she 
ould put them away until E could get a new 
Weight to replace the broken one. The S was 
Siven the extra caution not to break or bc 
“Top any of them; thus to hand them to 7 
ne ata time, The S was then given the test 
Stimuli, 

b Group D was told that there “seemed n 
ab à variable present in the few subjects 
V ed tested because son 
. 0e weight- itive an LA 
ince T ecd for this ability," E 


ad just taken five books ar! 
shelf. ‘The Ss were asked to weigh the books, 


Presented h the desk, am 
a aphazardly on E 
"range them in order from the lightest to the 


Caviest. The Ss were then given the test 
Group E was 


put using the 


pitrarily from her 


Stimul: 1 
SOAM as the "experiment." 
v T s 

en the same instructions 
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plastic containers as the so-called weight- 
sensitive pretest followed by the "experiment." 

Since the actual weight in grams and 
alphabetical order were identical for all an- 
choring materials, each S had the anchoring 
experiences in the identical serial order. All 
Ss in the experimental groups were asked 
whether they had associated the anchoring 
task and the "experiment" together after the 
session was completed. Only three verbalized 
any insight into the problem and their data 
were eliminated and new Ss added in their 
places. 


RESULTS AND DISCUSSION 


The mean weight judged ‘“med- 
ium," the frequencies of these judg- 
ments, and their standard deviations 
are presented in Table 1. 

Because of the nature of the experi- 
mental plan, the Friedman two-way 
analysis of variance by ranks was 
used. A x? = 16.78 was obtained 
which is significant at the .01 level. 
'The most effective anchoring seems 
to be reflected in judgments by those 
individuals who alphabetized the 
weights (Group C ), next by those who 
weighed the weights (Group B); the 
group who alphabetized the books 
(Group B); those who weighed the 
books (Group D), and finally the con- 
trol group (Group A) having no 
anchor had the lowest general mean 
value for the weights perceived as 
medium over the 12 serial positions. 

In order to ascertain the relative 
strengths of the anchoring materials 
on the formation of the new scale, the 
data of Groups C (alphabetizing 
weights) and E (weighing weights) 
were combined and compared with the 
results of Groups B (alphabetizing 
books) and D (weighing books), the 
prediction having been made that 
lifting plastic containers whether al- 
phabetized or weighed will cause a 
greater change in the mean weight 
judged medium as compared to the 
judgments of those Ss who handled 
books, dissimilar in shape to the test 
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TABLE 1 


MEAN WEIGHT IN GRAMS JUDGED MEDIUM (M), FREQUENCY OF MEDIUM JupGMENTs (V), 
AND SD As A FUNCTION OF SERIAL POSITION ror Groups A, B, C, D, ANDE 


Group A Group B | Group C Group D Group E 
Serial | 
Position 
M \|N| sD | M |N|SD| M |N| SD| M |n| sp | M |n| sp 
1 305.5 |11 |152.9 |340.0 |15 |153.6 |421.1 | 9 |107.4 |306.2 |13 147.4 |406.7 |15 | 117.5 
iA 247.5 | 8 | 98.7 |320.0 | 5 |106.8 |322.5 12 |113.8 |272.5 | 8 92.7 |355.5 |11 | 138.9 
3 302.9 | 7 |132.0 |338.6 | 7 |127.8 |331.4 | 7 | 95.8 250.0 | 5 86.0 |298.9 | 9 | 104.8 
4 335.0 | 8 | 96.8 297.5 |12 137.2 |278.8 | 8 | 60.9 |260.0 5 | 54.8 |295.7 | 7 | 109.3 
5 232.7 11 | 80.8 235.0 |10 |134.6 |235.0 |10 |128.9 193.3 |12 | 72.9 1289.2 |12 | 107.0 
6 185.0 | 9 | 43.3 232.7 |11 | 93.8 [318.3 | 6 123.9 210.0 | 4 | 50.0 |240.0 |10 | 100.5 
7 328.2 11 [133.6 |230.0 |10 |104.9 |285.0| 8 136.6 |245.0 |10 |102.6 |243.3 | 9 | 91.2 
8 237.2 | 9 | 60.7 |291.3| 8 | 96.6 |296.4 |11 71.0 |271.1 | 9 | 52.3 |285.0 |12 | 80.1 
9 278.2 | 9 | 88.7 |278.2 |11 |113.4 |264.6 11 | 62.0 |235.0 | 8 | 55.9 |268.3 |12 | 123.9 
10 226.7 | 9 |102.7 |200.0 |10 | 62.5 (265.0 |10 65.0 |193.3 | 9 | 47.2 |222.5 | 8| 69.2 
11 237.8 | 9 71.1 [245.0 |10 | 67.3 |255.0 |10 | 68.7 |240.0 |10 60.0 |269.1 |11 | 66.8 
12 [215.6 | 9 | 54.8 226.7 | 9 | 47.1 |230.0 |10 | 55.7 [216.3 | 8 | 39.0 2767 | 9| 624 
stimuli. Using the sign test, the hy- Group E. When dissimilar materials 


pothesis was supported on all 12 
positions of the series. It may be said 
that assimilation to homogenous ma- 
terials is greater than to heterogenous 
materials with a shift in the direction 
of the anchor at least over 12 trials 
with additional stimulus materials. 
This result supports the results of the 
judgmental sequences of weight lifters 
on the same critical test stimuli. 
The second prediction that con- 
scious weighing as a task would be 
more influential in establishing a refer- 
ent than an irrelevant task or inci- 
dental task such as alphabetizing 
was not supported (p = .073). Inthe 
light of previous studies such as that 
of Rambo, this finding is confusing. 
One explanation suggests that there 
is an interaction effect here. Rambo 
used the same materials in his anchor- 
ing and test series, i.e., dots and cards, 
but in this experiment there is con- 
scious weighing with and without simi- 
lar materials. Since there is a greater 
similarity in activity for Groups D and 
E, there should be and there was a 
significant difference using the sign 
test (p — .003) in the test series in 
favor of larger medium judgments by 


are used, as in Groups B and C, it was 
found that there was no significant 
difference in the judgments of medium 
weights. Thus, it could be suggested 
that anchoring produced by similar 
tasks will vary with the degree of 
similarity of the stimulus materials 
such that anchoring will be most 
effective where both task and ma- 
terials are similar. With this type of 
analysis, then, the statement made by 
Johnson on students’ lifting a chair 
with no effect on the weight scale they 
had formed, is also explainable since 
the extraneous material was very 
dissimilar. 

The arrangement of the stimuli in a 
series in which2each of 12 weights was 
presented in each of the serial posi- 
tions lends itself to another type of 
analysis. The greatest displacement 
of the subjective scale should be noted 
on the first trial and gradual agree- 
ment with the objective scale should 
occur in the later serial positions. 
Tresselt and Volkmann (1942) have 
suggested that there is an area of dis- 
continuity between the fourth and 
fifth serial positions. Arbitrarily rank- 
ing the mean weight judged medium 
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for Positions 1-12 for each of the four 
experimental groups from heaviest to 
lightest resulted in a total average 
rank of 1, 2.5, 3, and 4.7 in that order 
for Serial Orders 1, 2, 3, and 4. After 
the fourth weight was lifted, the rank- 
ings followed an irregular order of 8.6, 
7.5, 8.0, 4.7, 8.0, 10.6, 8.0, and 9.7. 
In brief, the means over the first four 
medium judgments followed a slight 
descending direction in weight and 
then seemed to oscillate in an irregular 
fashion around the objective mean 
medium value (285.0) of the weight 
series. 

The results of this experiment seem 
to indicate that having had relatively 
heavy anchoring materials displaces 
the center of a new scale upward 
specially when both anchoring ma- 
terials and test materials are similar in 
structure. When the judgments of 
the anchoring materials are the same 
Or different from the judgments re- 
quired on the test materials, there may 
or may not be a displacement of the 
New scale. If both task and materials 
are the same in both anchor and test 
Situation, there will be a shift in the 

nter of the scale in the direction of 
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the anchor; but if the tasks are the 
same and the materials differ, there 
will be little or no effect. Within the 
limits of this experiment it seems that 
the effect of the anchor probably 
fades out on or around the fifth serial 
position of exposure to a new set of 
stimuli, 
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CULTURAL PRIMARIES AS A SOURCE OF INTERFERENCE 


IN SHORT-TERM VERBAL RETENTION ! 


KENNETH A. BLICK? 


Tulane University 


Kent-Rosanoff stimulus words were separated into 3 categories having 
different cultural probabilities of producing R, (primary), Rs (second- 
ary), or Ra-4 (sum of R; to Ra). The average cultural probabilities of 
R, for the 3 categories were .14, .32, and .73. A proaction design 
(Training A, Training B, and Test B) with 60 Ss per category was used. 
Training consisted of a list of 5 Ry words followed by a list of 5 corre- 
sponding R» words. After 120 sec. of vowel cancellation, Ss were 
tested for recall of the Rə words in the presence of the 5 stimulus words 
corresponding to the R; and Rs words of training. Intrusions of Ry 
words were found to be strongly related to the cultural probability of Ri. 
This finding was interpreted as the most solid empirical demonstration 
we have that language habits can interfere greatly in short-term 


retention. 


Underwood and Postman (1960) 
have proposed that learned verbal 
material is subject to extraexperi- 
mental sources of interference. Ex- 
traexperimental sources refer to the 
individual's preexisting language hab- 
its which operate through proaction to 
interfere with the recall of more re- 
cently learned material. Indeed, Un- 
derwood (1957) suggested earlier that 
the major portion of forgetting is 
produced by proacting agents. To 
date, however, there has been no 
single, solid empirical demonstration 
that language habits do interfere 
greatly with the retention of labora- 
tory learned words. 

The source of interference to be in- 
vestigated here was derived mainly 
from the work of Russell and Jenkins 
(1954), Bilodeau, Fox, and Blick 
(1963), and Blick (1963). The former 
published norms for response words 


1 This article is an abridged version of a 
dissertation submitted to the Graduate Sahas 
of Tulane University in partial aug 
the requirements for the PhD degree. k è 
author wishes to acknowledge the assistanc: 


of E. A. Bilodeau and his project on memory 


supported in part by the National Science 


Foundation. 


2 Now at Randolph-Macon College. 


given in discrete-free association to 
the 100 Kent-Rosanoff stimulus words. 
The stimulus word evoked a number 
of response words which were ar- 
ranged in a hierarchial order of fre- 
quency. Bilodeau et al. made an 
analysis of these hierarchies and calcu- 
lated the probability (p) relationships 
between the stimulus words and their 
response associates. The primary 
word was called Ri, the secondary 
word was called Rs, and responses 
from 3 to n were called R;., collec- 
tively. For example, CHAIR is an Ri 
and its p of occurrence to the stimulus 
TABLE is .84. The p of R; (roop) is 
.04, and p of R3_, is.12. Three cate- 
gories of words were discovered in the 
norms. The probability of evoking 
Ra, Re, or R.,, given the stimulus, is 
shown in parenthesis in Table 1. The 
categories have different patterns of 
probabilities. For example, Category 
A has low 2 (R3), low (Rə), and high 
P(Rs-n); Category B has medium 
p(Rı), maximum 5(R;), and medium 
p(Rs-n); and Category C has high 
p(Rı), low (R3), and low p(R3-n)- 
These patterns can be used to great 
advantage. 

If S was trained with a list of Ri 
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words followed by a list of correspond- 
ing Rə words, it would be possible for 
the Ri words to interfere and intrude 
during a recall session for Rs words. 
This would be especially true if 
stimulus words corresponding to the 
Ri and R, words were presented 
during recall as a means of abetting 
the intrusions. Reminding during 
recall is a general method of control- 
ling recall and has been discussed ex- 
tensively elsewhere (Bilodeau, Levy, 
& Sulzer, 1963). 

The amount of interference from 
primaries should vary in relation to 
the strength of p(R:). Accordingly, 
Category C should produce the great- 
est number of intrusions because its 
b(R;) is highest. Category B should 
Produce a moderate number of in- 
trusions, and Category A should 
produce the least number. The Ri 
Probabilities among the three cate- 
gories were employed as a basis for 
predicting the amount of proactive 
Interference produced by the cultural 
primary, 


METHOD 


Subjecls and apparatus.—Ninety students 
from Louisiana State University and 90 male 
Naval enlisted reservists from the New Orleans 
area were unsystematically divided into three 
groups. A group contained 30 students and 
reservists. Of the 90 college students, 
there were 57 males and 33 females, and the 
cmales had approximately equal representa- 
tion from group to group. 
he basic instrument was a 
made from an opaque cover s 
Sheets of ditto paper. The five 
arranged in the sequence: cover Pal 
training page, second training pase 
Cancellation page, and recall page. 
Groups and procedure.—The three groups 
represented the three categories of words. | 
Category contained 15 stimulus words which 
Vere unsystematically divided into three lists 
sa five. The three lists within categories were 
Presented by 20 Ss each. N 
Following "the proaction paradigm, the 
Ooklets contained five Ri words and five Re 
Words on the first and second training pasts, 


respectively. The recall page contained the 


5-page booklet 
heet and four 
sheets were 
ge, first 
vowel 
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corresponding stimulus words and five empty 
lines for answers. Note that the R, and Rs 
words of training represented the primary. and 
secondary associates to the stimulus words 
written on the recall page. 

Booklets and pencils were distributed to 
intact classes by assistant Es who served as 
proctors. To reduce the chances of S opening 
the booklet prematurely, the pages were 
secured by a paper clip. The lists were dis- 
tributed in an unbiased fashion to reduce the 
possibility of copying. Page turning was 
paced by E who directed S when to turn and 
read the brief instructions printed at the top 
of the pages. 

A total of 20 sec. per page was allowed for 
training, and on both pages the instructions 
read: “Study the words shown below." After 
training, Ss were given 120 sec. to cancel 
vowels. The recall page read: 


A short while ago you studied some words 
on page —. The words below may help 
you remember those words. Write down 
the words that appeared on page —. You 
must fill in every line. If you cannot 
remember, write in the first word that the 
printed word makes you think of. Do not 
look back to the previous pages. 


"The recall phase lasted 120 sec. 

The first and second training pages were 
labeled A and B, respectively, and the two 
blanks in the recall instructions were filled 
with Bs. The Bs were hand printed in red 
ink and were emphasized in E's oral instruc- 
tions so that S would not mistake the pre- 
scribed page. 

Each word recalled was classified according 
to one of six sources of interference. Three 
sources consisted of Position in the Kent- 
Rosanoff Hierarchy (Ri, Rs, or Rs-n). The 
other three sources included Same-List 
Intrusion (a correct word given in the wrong 
place on the recall page), Blank (where S 
failed to respond), and Remainder (none of 
the previous five). Only Ri and Rs words 
written in the place corresponding to their 
position in the training list were scored as 
other-list intrusions (Ri) or as correct re- 
sponses (Rz). When Ri or Rs appeared out of 
proper position, Rs., took precedence in 
scoring. This rule provided a conservative 
estimate of both other-list and same-list 
intrusions. 


RESULTS 


In Table 1 the 900 (180 X 5) pieces 
of recall data are classified by Cate- 
gory and Position in the Kent- 
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TABLE 1 


OBTAINED AND NORMATIVE RESPONSE 
PROBABILITIES CLASSIFIED BY 
CATEGORY OF STIMULATION 
AND SOURCE OF RECALL 


Position in K-R Hierarchy 


Category 
PR) 2(R2) D(Ri) 
A .21(.14)^ 51(.12) -18(.74) 
B .40(.32) 42 (.24) .07 (44) 
ie 55 (.73) 29(.05) .08(.22) 


a Russell-Jenkins normative probabilities. 


Rosanoff hierarchy. The value .21 
entered under (Rx) for Category A 
means that 21% of the 300 recalled 
words consisted of intrusions from the 
other list, the value .51 under p(R») 
means that 51% of the 300 words 
Were correct, and so on. 

The obtained probabilities show 
considerable agreement with the 
trends of the normative probabilities 
shown in parenthesis. The most im- 
portant result occurred under (Ri) 
where the incidence of primaries in- 


3.0 


20 


MEAN NO. OF INTRUSIONS PER S 


32 
PROBABILITY OF Rj 
Mean number of intrusions of 


es as a function of the 
for Categories A, 


Fic. 1. 2 

i i ondari 
primaries or sec S 
normative probability of Ri 


B, and C. 
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creased as the normative p(Ri) in- 
creased. The stimulation of recall 
has also raised the values of p(R:) 
over their original cultural incidence, 
though, Category B did not produce 
the maximum (Rs). Also, the data 
show a slight reversal under p(Rs—n) 
for Categories B and C, but a differ- 
ence of .01 is minor. 

The positive relationship between 
the mean number of intrusions of 
primaries (out of possible five) and 
the normative p(R1ı) is shown by the 
upper curve in Fig. 1. Analysis of 
variance for Category yielded a sig- 
nificant effect, F (2, 177) = 18.82, 
p «.01, and a comparison of the 
means by Duncan's range test in- 


TABLE 2 


FREQUENCY DISTRIBUTION OF Ss PRODUCING 
OTHER-LIST INTRUSIONS 


No. of Primaries Intruding 


Cate- S 
gory Total 
0 | H 2 3 4 5 
A 17 | 26 | 11 6| 0 0| 60 
B 16 9 | 10 | 11 7 7| 60 
C T 9 | 13 9 15 | 60 


dicated that the categories differed 
significantly from each other at the .05 
level.’ The two lower curves in Fig. 1 
provide valuable control comparisons 
since they represent the same three 
categories taken from the Bilodeau, 
Fox, and Blick study which trained 
with unrelated R} words and no pri- 
maries. The lowest curve, which rep- 
resents intrusions of R words, reveals 
the lack of relationship between the 
number of other-list intrusions and 
P(Ri;) in marked contrast to the 
present findings. Not only do these 
data show no relationship but also the 
absolute values of intrusions are quite 


3 The data for students and reservists were 
combined because all conclusions concerning 
intrusions as a function of p(R,) were the 
same for both groups. 
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low. This control comparison shows 
that other-list intrusions are low when 
the interpage relationship is low. The 
middle curve reflects the occurrence 
of primaries when Ss are not trained 
with primaries and shows the level of 
interference produced solely by pre- 
laboratory habits. The top curve, 
then, illustrates the combined effects 
of laboratory and prelaboratory habits 
on intrusions of R, at recall. Addi- 
tional information regarding intru- 
sions of R;'s is provided by a replica- 
ton of the present study which 
omitted the stimulus words during 
recall. With the stimulus words 
absent, the occurrence of R ;'saverages 
only 7% for the three categories. 

A breakdown of the production of 
Ris by Ss is presented in Table 2. 
lhe strength of the effect of (Rx) in 
Producing an R; is measured by the 
Change in the distribution from cate- 
gory to category. As p (R3) increases, 
there is a corresponding increase in 
the number of Ss giving more Ri's. 
For example, the percentage of Ss 
giving four and five Ri's is 0, 23, and 
40 for Categories A, B, and C, re- 
Spectively. Note also that 15 of 60 
Ss in Category C have recalled all the 
items of the incorrect list and have 
Offered them as items of the correct 
list, 

Approximately 90% of the re- 
Sponses were found tabled in the 
Russell-Jenkins norms; a result which 
demonstrates the generalizability of 
the Minnesota norms. Only a small 
Portion of the recall data (10% or less) 
Was explained as a same-list intrusion, 

lank, and remainder. These sources 
9f interference should be of far greater 
Importance under other conditions. 


DiIscussION 
These data, in accordance with the 
Previous study (Bilodeau, Fox, & Blick, 
1963 ; demor he power of cultur- 
nstrate the p 
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ally determined probability relationships 
to predict what other people will say in a 
retention test. The critical finding was 
that the number of intrusions of pri- 
maries increased as the cultural p(R,) 
increased. Deese (1959) was also able 
to predict verbal intrusions in immediate 
recall using cultural association strength 
as the independent variable. He used 
word lists composed of the first 12 terms 
(Ry through Ri in the present notation) 
and discovered that the stimulus word 
intruded in recall. The stimulus word 
was predicted from its percent occurrence 
as a response associate to the R, to Ri» 
words in a special side experiment using 
the Rı to Ris words as stimuli. In a 
related study, Rothkopf and Coke (1961) 
were able to predict the recall of Kent- 
Rosanoff stimulus words from word- 
association norms. They concluded that 
frequency of recall of a stimulus word in 
free recall was a direct function of the 
number of cues provided by other stim- 
ulus words in the list. The cue number 
of a stimulus word was defined as the 
number of stimulus words in the list to 
which the word in question occurred as 
an association response. 

It is desirable to place in perspective 
the magnitude of interference produced 
by the present technique. As much as 
55% interference was produced and in a 
substantial number of Ss intrusions of 
primaries were total (100%). In the 
study by Bilodeau, Fox, and Blick (1963) 
which trained with unrelated Rə words 
and no primaries, the maximum produc- 
tion of primaries was smaller (22% or 
less). Many studies have used unrelated 
nonsense syllables and have produced 
about 1% interference and related non- 
sense syllables have produced about 5% 
interference. It was for the reason that 
interference is typically small and diffi- 
cult to produce that the present study 
was undertaken. Since the present find- 
ings raise the obtained level of interfer- 
ence by an order of magnitude, an 
interference theory of forgetting becomes 
much more attractive than it was before. 

The procedure of opposing primaries 
and secondaries during training and 
stimulating recall by means of a common 
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stimulus term could provide a substitute 
technique for testing the unit-sequence 
hypothesis proposed by Underwood and 
Postman (1960). According to this 
hypothesis, frequency of usage is the 
major factor responsible for producing 
interference. The assumption is that 
more frequently used words will have 
stronger associative connections than 
less frequently used words, and retention 
for high-frequency words will be poorer 
because the associative connections will 
recover with rest and interfere with 
recall. Experiments specifically designed 
to test the unit-sequence hypothesis have 
produced rather bland results (Postman, 
1961, 1962; Underwood & Postman, 
1960). It now seems that the probability 
relationships between a word and its 
response associates provide a better 
procedure for producing interference. 
Frequency of occurrence of words may 
not be as important a variable as the 
probability relationships between stim- 
ulus and response terms. 

Finally, a question about the experi- 
mental design : Why was the first training 
page with its Ri’s used at all? A trial of 
practice with R,’s was used in order to 
increase the p of an intrusion of R, since 
the purpose of the study was to maximize 
intrusions of Ry. If the purpose had 
been to study the sole effects of pre- 
laboratory habits (Bilodeau, Fox, & 
Blick, 1963), no primary words would 
have been presented during training. As 
it stands, the design used is considered 
to enhance the probability of interference 
from preexperimental S-R habits. Re- 
questing S to free associate to a stimulus 
during the test if he cannot remember the 
word studied may be ruled out of bounds 
by some students of memory. The 
method is unique, for it is a blend of re- 
tention and free-association procedures. 
It possesses none of the virtues of either 
when used alone, and is not intended to 
replace their separate use; but in combi- 
nation tells us about the quantity and 
quality of misrecall under quantified 
conditions of modified free recall. In- 
structing S to use the recall stimulus 
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leads to predictable patterns of cultural 
primary intrusions; exposing the prima- 
ries during training augments such intru- 
sion patterns. These misrecalls may be 
indexes of the forgetting process itself or 
they may only be responses substituting 
after forgetting has occurred; the present 
design does not distinguish between these 
two alternatives. Most importantly, the 
design does show how particular misre- 
calls can be regulated in large quantities. 
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SOME EFFECTS OF CON 
BRIGHTNES 


PHYLLIS W. BERMAN 


University 


Simultaneous 


TOUR ON SIMULTANEOUS 
S CONTRAST! 


AND H. W. LEIBOWITZ? 


of Wisconsin 


brightness contrast was measured as a function of: (a) the 


orientation of a test object, shaped as a figure 8, on a half light, 
half black surround, (b) type and width of a contour separating the 


ligi 
brightness of th 
light surround. Su 
when the figure 8 was pr 
different brightness than w. 
when figure h 


than when a dividing | 
tour between halves was increased. 
of the contribution of the border 


complex stimulus configurations. 


A gray stimulus viewed against a 
white background will appear darker 
than when viewed against a black 
background. This is an example of 
Simultaneous brightness contrast, à 
Phenomenon which can be interpreted 
in terms of neural inhibition at the 
retinal level following the implications 
of a number of recent electrophysio- 
logical experiments (Ratliff, Hartline, 
& Miller, 1963). Differences in excita- 
tion are exaggerated by interaction, 
the magnitude of the difference de- 
Pending in a systematic manner on 
the luminance difference, size of back- 
&round and test-fields, and the separa- 
tion between the fields (Diamond, 
1960, 1962; Heinemann, 1955). 

There are, however, some aspects 
9f the subjective contrast effect, such 
as its relation to contour, which do not 

t readily into a theoretical approach 
ased simply on neural inhibition. In 
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alves were moved apart, l 
ine separated the halves, (c) as width of the con- 


ure halves on the divided background. 48 adult Ss matched the 
e figure half on the dark background with that on the 
bjective contrast was significantly greater: (a) 
esented with its rings on backgrounds of 
hen each ring lay on both backgrounds, (b) 


each into its own surround, rather 


. The results are discussed in terms 


to subjective contrast obtained with 


a classical demonstration, Wundt 
showed that a gray paper placed over 
the border between a black and white 
area will tend to appear uniformly 
gray despite the expected effects of 
simultaneous contrast (Osgood, 1953). 
If however, a contour is added to the 
gray stimulus, so as to separate the 
portion viewed against the black from 
that viewed against the white back- 
ground, the expected contrast effect 
is manifested. Helson (1943) dis- 
cussed this phenomenon in terms of 
the “assimilation” of the contrast 
effect within a common border. 
Koffka (1935) theorized that shape 
acts as a “force of cohesion" which 
plays a role in the appearance of 
contrast. He suggested that if a gray 
object is used which has two clear sub- 
divisions, degree of contrast will vary 
with its orientation on the contrast- 
inducing background. These phe- 
nomena suggest that subjective con- 
trast depends, not only on luminance 
differences, but also on the presence 
of an identifiable contour or shape 
with respect to the stimulus. 

The present experiments were de- 
signed to determine quantitatively 
the magnitude of the subjective con- 
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trast effect as a function of the width 
of the contour separating the two 
halves of a figure 8 test object, and 
as a function of the orientation of the 
test object on the black and white 
background. Such data should be of 
value in evaluating the importance of 
contour as a contributing variable to 
the phenomenon of simultaneous 
brightness contrast. 


APPARATUS 


The stimulus consisted of a figure 8 cut 
from neutral density film (D = 0.7) and 
mounted flush against a piece of vertically 
positioned milk glass.  Illuminance was 
provided by a 300-w., 110-v. ac projector 
system focused on the opposite side of the 
milk glass. The right half of the stimulus (see 
Fig. 1) was positioned within a clear rectangle 
and appeared less luminous than its back- 
ground. The left half of the figure was sur- 
rounded by opaque black paper and appeared 
more luminous than its background. Except 
for the figure 8 and the light background sur- 
rounding the right half of the figure, there 
were no other stimuli in the visual field. The 
remaining portions of the milk glass were 
covered with opaque black paper. 

The S was able to match the two halves 
of the figure 8 by means of a pair of polaroid 
filters. The polarizer, made from sheet 
polaroid, was placed over the figure half lying 
on the black background. The analyzer, a 
similar piece of polaroid mounted in glass, 
was placed in the eyepiece through which 
S observed monocularly with the natural 
pupil. The S rotated the analyzer until the 
two figure halves appeared equal. ‘The 


AND H. W. LEIBOWITZ 


matched luminance value of the left half of 
the figure could be calculated from the 
angular position of the analyzer and the 
calibration data of the polaroid system. 

The figure 8 stimulus was symmetrical 
horizontally and vertically with an outside 
diameter of each ring of 21 in. The rings were 
$ in. width. The viewing distance was 34.5 
in. at which the maximum dimensions of the 
figure 8 were 3.72 X 7.44? of arc. The bright 
background against which the right half of 
the figure was viewed subtended a visual 
angle of 13.7 X 5.8? of arc. 

The magnitude of the contour was varied 
by presenting the stimulus with no inserts 
between the two halves of the figure, and by 
placing vertical wires or strips of the following 
widths symmetrically over the center of the 
figure: .005, .01, .02, .04, .08, .16, .32, and 
-64 in. "These dividers corresponded to visual 
angles of 0.5, 1.0, 2.0, 4.0, 8.0, 15.9, 31.9, and 
63.7’ of arc. The luminance of the right half 
of the test figure was constant at log 2.0 mL. 
and the background luminance was constant 
at log 2.7 mL. 


EXPERIMENT | 
Method 


Two groups of 12 Ss each, volunteers with 
normal vision from the elementary psychology 
classes at the university, first made four 
practice matches using a square test object 
and a dividing strip subtending 63.7’ of arc. 
Group A was then presented the figure in the 
vertical position, and Group B in the hori- 
zontal position (see Fig. 1). The nine experi- 
mental conditions for each group, i.e., eight 
dividers and no divider, were counterbalanced 
by a 9 X 9 Latin-square design. Subjects 
10, 11, and 12 were presented with the same 
sequence as Ss 1, 2, and 3 since ordinal 


TABLE 1 


AVERAGE LOG MATCHED LUMINANCE IN MILLILAMBERTS OF Gnours A, B, C, AND D as A 


FUNCTION or WIDTH DIVIDER OR SEPARATION BE" 


TWEEN FIGURE HALVES 


Width Divider in Minutes of Arc 


Group T = o f 
0 0.5 1.0 2.0 "EY &0 "m m am 

v g| o97| 095| 078! oso| 065! 031} 0:3 | 015 
B 0.08 | —0.20 | —0.15 | —0.10 | —0.22 | —0.24 | ~030 | -0:51 | Zon 
C with separation | 0.92 pee) aan | -021 
C with divider, 0.92 | : 
D with separation | 0.12 | 
D with divider 0.12 | 


and B are from Exp. I. Groups 


Note.—Groups A 


C and D are from Exp. II. 


CONTOUR AND SIMULTANEOUS BRIGHTNESS CONTRAST 


o o = 
[o] oa Oo 


LOG MATCHED LUMINANCE (mL) 
| 
o 
a 


[*] 0.5 | 


to 
tn 


2 4 8 l6 32 64 


DIVIDER WIDTH (MINUTES OF ARC) 


Fic. 1. Log matched lumin [ 
black background matched with the right h 


ground luminance was constant at log 2.7 mL. 
The data in the upper curve, 


Was constant at log 2.0 mL. 


With vertical orientation of the configuration [Insert A 
lower curve, indicated by crosses, were obtained with 


position, as indicated by : of variance, 
Was not a significant variable. The 5 made 
four matches at each of the experimental 
conditions. "- 

Each S was given a period of preliminary 
dark adaptation after which instructions were 
given regarding the nature of the task and 
the method of rotating the polaroid in the eye- 
Piece so as to produce a match between the 
two halves of the test object. The Ss were 
instructed not to try to match any one section 
of the figure with any other section, but to 
make the best overall match possible. 


Results 


The data are presented in Table 1 
and plotted in Fig. 2 as the log 
Matched luminance as a function of 
Width of the divider for the two 
Orientations used. Table 2 is a 
Summary of an analysis of the vari- 
ance associated with these data. All 


ance of the left half of a figure 8 test object viewed againsta 
alf viewed against a light background. 


(The back- 
The luminance of the right half of the figure 
indicated by circles, were obtained 
; no divider present]. The data for the 
a horizontal orientation [Insert BJ.) 


terms tested in this analysis are sig- 
nificant at the .001 level. 

Even in the absence of any divider 
between the two halves of the test 
field, indicated by the zero position 
on the abscissa axis of Fig. 2, con- 
siderable contrast existed. Whereas 


TABLE 2 
ANALYSIS OF VARIANCE OF MATCHING 
SconEs or GROUPS A AND B UstxG 
NiNE WiprR. DIVIDERS 


Source | MS | n 


Between Ss 
Figure position (FP) 1 
or ( 22 
Within Ss 
Width divider (WD) 
WD X FP 


427,645.0 
22,339.1 


e 5.001. 
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the luminance of the portion of the 
figure viewed against the white back- 
ground was log 2.0 mL., it was 
matched by the portion viewed against 
the black background at an average 
value of log 1.1 mL. for the vertical, 
and log 0.1 mL. for the horizontal 
orientation. 

The effect of introducing the divider 
is considerable, amounting to factors 
of approximately 16 to 1 for the 
vertical, and 5 to 1 for the horizontal 
orientation. As dividers of greater 
width were introduced the amount of 
contrast was systematically increased, 
indicated by the increasingly lower 
luminance values of the left half of the 
figure required to produce a match 
between the two halves of the test 
figure. This effect was exhibited with 
the test object in both orientations, 
but there was significantly more con- 
trast for all divider widths with the 
test figure oriented horizontally than 
with the vertical orientation. 

Increasing the width of the divider 
had a greater effect with the test figure 


TABLE 3 
ANALYSIS OF VARIANCE OF MATCHING SCORES 
or Grours C anp D wrrH .160-, .320-, 
AND .640-IN. DIVIDERS AND SEPARATION 
BETWEEN HALVES 


Source df MS F 
Figure position (FP) 1 | 99,487.6 | 15.93*** 
Ss/FP 22 | 6,244.6 | 33.97*** 
Width between halves (W) | 2 | 15,071.7 | 32.76*** 
Beton as and sepa- | 4 | 47,844.5 | 16.35% 
FP X D-S 1 | 19,716.9 | 18.07*5* 
WwW X FP 2 745.9 | 1.62 
w x D-S 2| 1,683.5 | 9.16% 
w XFP XD-S 2 | 5,968.4 | 32.47*** 
S3/FP XW 44 460.1 | 2.5095 
Ss/FP X D-S 22 | 1,091.1] 5.94595 
Ss/FP X W X D-S 44 183.8 
Total 143 


~ Note.—Ss/FP is the error term for FP. Ss/EP 
x Ws the SS/PP etm for Wand for W X FP. Ss/FP 
X D-S is the error term for D-S and for FP X D-S. 
SS/FP XW X D-S is the error term for all other 


RED < ‘001. 
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in a vertical position. This can be 
seen in Fig. 1 and is reflected by the 
significant interaction between Width 
Divider and Figure Position in the 
analysis of variance presented in 
Table 2. An extended Alexander 
trend test was used to determine 
whether the increase in contrast was 
significantly more linear with the test 
figure in the vertical position than 
with it in the horizontal position 
(Grant, 1956). The interaction term 
was broken into the component terms, 
Between Group Slopes, Quadratics, 
Cubics, and Deviations from the Ex- 
pected. Of these, only Between 
Group Slopes is significant, F (1, 22) 
= 8.86, p < .01. 


EXPERIMENT II 
Method 


In Exp. I the dividing strip covered part 
of the test object so that an increase in the 
width of the divider decreased the effective 
area of the test object as well as the back- 
ground. Since area is an important variable 
in contrast (Diamond, 1960, 1962), a control 
experiment was conducted in which the two 
halves of the test object were physically 
separated so that, while the width of the 
"contour" between them increased as in 
Exp. I, their areas, as well as the area of the 
light background, remained constant. Two 
additional groups of 12 Ss each were tested 
under conditions of no divider, and with 
dividers as well as separations of .16, .32, and 
.64 in. Group C was presented with the test 
object in the vertical position, and Group D 
with it in the horizontal position. 

The order of presentation of the seven 
treatment conditions was determined by a 
7 X7 Latin-square design with the same 
design being repeated for both groups. The 
results of a t test between the most divergent 
scores indicated that ordinal position was not 
significant, so that the five additional Ss in 
each group were presented with the same 
order of treatment conditions as Ss 1, 2, 3, 4, 
and 5. 


Results 


The data are presented in Table 1 
and Table 3 presents an analysis of 
the variance of these data. Since only 
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one measurement per S represents the 
zero level of both the Divider and 
Separation conditions, only the scores 
for the .16, .32, and .64 widths were 
included in this analysis. 

As in Exp. I, both Width and 
Figure Position are significant sources 
of variability. 

The overall effect of separation of 
the halves of the test figure was to 
produce significantly more contrast 
than was perceived when a dividing 
strip was used to create a contour be- 
tween the figure halves. The signifi- 
cance of the interaction term, Figure 
Position X Divider Separation, re- 
flects the fact that this effect differed 
for the two orientations of the test 
figure used. Separation produced 
more contrast when the test figure was 
in the vertical position. For the 
horizontal orientation, either proce- 
dure produced similar results. . 

In the analysis summarized in 
Table 3 Width X Figure Position is 
insignificant but the second-order 
interaction, Width X Figure Position 
X Divider Separation, is significant. 
Analyses of variance Were computed 
separately for the Separation. condi- 
tion and for the Divider condition an 
data for the zero level of each was 
included. Width X Figure Position 
was significant only for the Separation 
condition, F (3, 66) = 4.60, P < ot. 
This finding differs from the result in 
Exp. I, that Width Divider X Figure 
Position is significant; but with nine 
instead of three widths dividing strips 
used. " 

Width is a significant variable 
in both the Separation condition, 
F (3, 66) = 62.83, p < 001; and the 
Divider condition, F (3, 66) = 36.39, 
b < 001. However, the increase in 
contrast perceived with cap roman 
widths between figure halves differed 
in the Separation and Divider condi- 
tions, This is indicated by the signifi- 
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cant interaction, Width X Divider 
Separation, in Table 3. 


Discussion 


The results of the present experiments 
provide quantitative confirmation of the 
phenomenal impressions derived from 
early demonstrations. The contribution 
of border to the subjective contrast effect 
is clearly demonstrated by the marked 
change in the magnitude of contrast re- 
sulting from the introduction of thin 
dividing strips between the halves of the 
test object. Of particular interest is the 
influence of the smallest divider used in 
Exp. I, which subtended only 30” of arc. 
This value corresponds to the angle 
subtended by individual cones in the 
center of the human fovea, and is 
approximately the same order of magni- 
tude as the threshold for grating acuity 
(Leibowitz, 1953). This fine line is seen 
here to be adequate to increase contrast 
for both orientations of the figure. It is 
highly unlikely that the effect of a line 
of this width, which represents approxi- 
mately 0.2% of the total width of the 
test object in the vertical orientation and 
0.1% of the total width in the horizontal 
orientation, can be ascribed to changes 
in luminance or area relationships. The 
finding that there is not less but more 
difference in brightness between figure 
halves in the Separation condition in- 
dicates that the findings cannot be 
attributed to changes in area resulting 
from introduction of dividing strips. A 
more plausible explanation would be in 
terms of an “equalizing mechanism” such 
as that described by Fry (1948; Fry & 
Bartley, 1935), which is assumed to be 
responsible for the assimilation of differ- 
ential contrast effects throughout the 
whole figure or, when the divider is 
introduced, within the figure halves. 

When there is no divider introduced, 
the shape of the figure itself may have an 
influence similar to the effect of such a 
contour. The extent to which this is 
manifested depends on the position of 
the test object on the divided back- 
ground. Because the figure 8 test object 
is of such a shape that two component 
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parts are identifiable, the position of 
these two parts on the divided back- 
ground is important. More contrast is 
observed between the figure halves 
viewed against different backgrounds 
than when the rings of the figure each 
lie on both the black and white back- 
grounds, as in the vertical orientation. 

Of interest is the observation that con- 
siderable subjective contrast exists in the 
absence of any dividing line between the 
two halves of the figure on backgrounds 
of different brightness. The effect is 
probably not noticeable in the more 
familiar demonstrations of this phe- 
nomenon using paper stimuli. With 
such stimuli contrast appears to be re- 
duced by the presence of the visible 
microstructure of the paper surface, and 
it may be increased by observing test 
objects through a diffuser such as tissue 
paper (Woodworth & Schlosberg, 1955). 
In the present stimulus arrangement 
transmitted light was utilized and no 
microstructure was visible. In addition 
the luminance differences among the test 
object and the light and dark back- 
grounds with transmitted light are far 
in excess of that available with paper 
stimuli. Thus, the present arrangement 
results in far more favorable conditions 
for contrast than ordinarily exist. The 
familiar observation that a divider is 
necessary for contrast to be manifested 
seems to be a special case resulting from 
the use of paper stimuli. 

In the present experiments the role of 
the shape of the test object and of fine 
dividing lines has been demonstrated. 
It follows that simultaneous contrast is 
not a simple function of luminance and 
spatial variables. This does not imply 
that these variables are not fundamental 
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to contrast, but rather that additional 
concepts are needed to fully explain 
subjective contrast with more complex 
stimulus configurations. 


REFERENCES 


DrAMOND, A. L. A theory of depression and 
enhancement in the brightness response. 
Psychol. Rev., 1960, 67, 168-198. 

DIAMOND, A. L. Simultaneous contrast as a 
function of test-field area. J. exp. Psychol., 
1962, 64, 336-345. 

Fry, G. A. Mechanisms subserving simul- 
taneous brightness contrast. Amer. J. 
Optom., 1948, 45, 1-17. 

Fry, G. A., & BagrLEY, S. H. The effect of 
one border in the visual field upon the 
threshold of another. Amer. J. Physiol., 
1935, 112, 414-421. 

Grant, D. A. Analysis of variance tests in 
the analysis and comparison of curves. 
Psychol. Bull., 1956, 53, 141-154. 

HEINEMANN, E. G. Simultaneous brightness 
induction as a function of inducing- and 
test-field luminances. J. exp. Psychol., 
1955, 50, 89-96. 

Hetson, H. Some factors and implications 
of color constancy. J. Opt. Soc. Amer., 
1943, 33, 555-567. 

KorrKa, K. Principles of gestalt psychology. 
New York: Harcourt, Brace, 1935, 

Lersowitz, H. Some observations and theory 
on the variation of visual acuity with the 
orientation of the test object. J. Opt. Soc. 
Amer., 1953, 43, 902-905. 

Oscoop, C. E. Method and theory in experi- 
mental psychology. New York: Oxford 
Univer. Press, 1953. 

RATLIFF, F., HanTLINE, H. K., & MILLER, 
W. H. Spatial and temporal aspects of 
retinal inhibitory interaction. J. Opt. Soc. 
Amer., 1963, 53, 110-120. 

Woopworth, R. S., & ScutospErRG, H. Ex- 
perimental psychology. New York: Holt, 
1955. 


(Received January 17, 1964) 


Journal of Experimental Psychology 
1965, Vol. 69, No. 3, 257-262 im) 


PREEXPOSURE TO VISUALLY PRESENTED FORMS 


AND NONDIFFERENTIAL REINFORCEME 


IN PERCEPTUAL LEARNING! 


LARRY C. KERPELMAN ? 


University of Rochester 


An experimental clarification of previous preexposure experiments which 
seemed to support the differentiation viewpoint of perceptual learning 


was undertaken. 
sparse environment. 
experimental Ss. 
stimuli was varied, and the 


79 albino rats were reared from birth in a visually 
Circles and triangles hung in the cages of the 
Nondifferential food reinforcement of these cage 
resultant effect on subsequently learning to 


discriminate between them was examined. Discrimination performance 


of the main experimenta 


1 groups confirmed the hypothesis that non- 


differential reinforcement played a role in previous preexposure studies 


of perceptual learning, t 
sults for supplementary groups ! 


hus supporting the enrichment approach. 
necessitated expansion of that approach 


Re- 


to include possible reinforcing effects of novelty. 


Several studies in which rats were 
exposed to stimuli (usually circles and 
triangles) from birth and later learned 
to discriminate between them better 
than nonexposed Ss (Gibson & Walk, 
1956; Gibson, Walk, Pick, & Tighe, 
1958; Gibson, Walk, & Tighe, 1959; 
Walk, Gibson, Pick, & Tighe, 1958, 
1959) have been used to support the 

differentiation” view of perceptual 
learning. That view (Gibson & Gib- 
Son, 1955) holds that experience with 
Stimuli that are to be discriminated is 
all that is necessary in certain situa- 
tions for perceptual learning to occur. 
It is not necessary to associate drive 
reduction, memory, or other factors 
With the stimuli. The Gibsons have 
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named the opposing viewpoint, which 
subsumes association, reinforcement, 
and drive-reduction theories, *en- 
richment.” The differentiation inter- 
pretation of the experiments cited 
above (which will hereafter be referred 
to generically as the Cornell studies) 
is exemplified by the statement: “The 
theories which best seem to fit the 
results all emphasize experience per 
se rather than reinforced experience 
[Walk et al., 1958, p. 487]." 

An alternative explanation of those 
results is nevertheless plausible. In 
the Cornell studies, experimental Ss 
received food, water, and other rein- 
forcement in the presence of the cage 
forms to which they were preexposed. 
Hence the stimuli were nondiffer- 
entially reinforced. Spence (1936) 
postulated that nondifferential rein- 
forcement of, for example, two stimuli 
raises the excitatory tendency of both. 
On introducing differential reinforce- 
ment (as in the discrimination task), 
enhancement of learning would be 
expected in Ss receiving prior non- 
differential reinforcement (the pre- 
exposed Cornell experimental Ss) as 
compared to those receiving little or 
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no prior reinforcement—differential or 
nondifferential—contiguous with the 
stimuli (the nonexposed Cornell con- 
trol Ss). By this formulation the pre- 
exposed Ss in the Cornell experiments 
might be expected to learn the sub- 
sequent discrimination faster than the 
controls because, at the beginning of 
discrimination training, the stimuli 
for the former group were at a higher 
level of excitatory tendency than they 
were for the latter group. Because 
the Cornell studies have important 
implications for perceptual theories, 
the present study was undertaken. 
The effect of nondifferential reinforce- 
ment in a preexposure situation was 
examined by varying the presumed 
strongest reinforcement, food, within 
conditions replicating, in all essentials, 
those of the Cornell studies. 


METHOD 
Subjects 


The Ss were 79 Charles River CD albino 
rats, 40 in Replication 1 (R1) and 39 in 
Replication 2 (R2). These were selected from 
a pool of 105 rats born to 10 pregnant females 
from the Charles River Breeding Laboratories. 
The 55 rats providing the R1 S pool were all 
born within a span of 5 hr.; the 50 rats 
providing the R2 S pool were all born 47 days 
later within a span of 6 hr. 


Rearing 
The Ss were born and reared in cages made 
of 3-in. wire mesh, the cages placed in white 
compartments dimensionally identical to 
those of Walk et al. (1958). The backs of 
the compartments were flush with the white 
wall of the room, and the front was covered 
by a white oilcloth curtain hanging 8 in. from 
the edge of the compartments. Water was 
available ad lib. Thus the visual environ- 
ment of each S during the experiment con- 
sisted of the compartment, the cage, the water 
bottle, the cage mates, the food tray for 
certain periods each day, and, for the experi- 
mental Ss, the cage stimuli for certain periods 
each day. Each of the R1 cages was lit from 
7:00 A.M. to 6:00 P.M. daily by a shielded 10-w. 
bulb on the compartment ceiling; R2 cages 
had no such arrangement but received normal 
room- and daylight during the same period, 
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Approximately 1 day after birth the five 
litters of each replication were cross-fostered, 
each mother receiving an equal number of 
pups. Each group thus formed was assigned 
to one of the four experimental conditions or 
to the control condition. The Ss nursed 
ad lib. until age 21 days when they were 
weaned and sexed. Each sex within each 
condition was placed in a separate cage, three 
to seven Ss per cage. A 12-hr. feeding 
schedule was begun, and, for the experimental 
groups, stimulus-placement procedures (dis- 
cussed below) were initiated. Except for 
cross-fostering and weaning, Ss were not 
handled during rearing. 

After weaning, Ss were fed twice daily with 
a mash of powdered laboratory meal mixed 
with water. The food was placed in white 
food trays that were hung from the cage wall 
for feeding and then removed after 1 hr. The 
first feeding was at approximately 8:00 A-M., 
the second, at approximately 7:00 P.M. 

Four stimuli, two circles and two triangles, 
hung in the experimental Ss’ cages, one 
against each wall of each cage, for a period of 
time daily (discussed below). They were 
made of flat j;-in. brass painted flat black. 
The circles were 3 in. in diameter, the tri- 
angles, 3} in. per side. Soldered to the top 
of each form was a length of wire by which the 
stimuli were hung in the cage. They were 
rotated at each feeding so that a circle and a 
triangle hung against each wall each day, 
and, specifically, a circle or a triangle was 
alternately at the wall where Ss received food 
each day. 


Experimental Conditions 


Stimulus placement procedures were intro- 
duced at age 21 days because the ad lib. 
nursing allowed up to that time precluded 
control of feeding and stimulus perception— 
especially when Ss' eyes opened at age 16 days 
—as required by the experimental conditions. 
These procedures continued throughout the 
experiment. For Group E1 (22-Hr. Reduced 
Reinforcement), the four cage stimuli were 
present 22 hr, daily, being removed for 2 hr. 
daily at each 1-hr. feeding period. This grouP 
provided the needed conditions to discern the 
factors involved in the improved discrimina- 
tion performance of preexposed groups in the 
Cornell studies. For Group E2 (22-Hr. 
Nondifferential Reinforcement), the cage 
forms were present 22 hr. daily, being removed 
twice daily at separate 1-hr. nonfeeding times 
to equate total amount of preexposure with 
that of Group E1. Since the forms were pre- 
sent during feeding, this group was comparable 
to the typical Cornell experimental group. 
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The Ss of Group E1 might have been open 
to reinforcement factors other than the 
manipulated one, food, such as social rein- 
forcement, secondary reinforcement, and 
primary reinforcement of the thirst drive. 
Because these might have confounded the 
food reinforcement variable, supplemeatary 
groups were instituted which were expected 
not to be as prone to such influences. Group 
El’ (2-Hr. Reduced Reinforcement) was 
exposed to the forms for 2 hr. daily at separate 
l-hr. nonfeeding times. Group E2' Q-Hr. 
Nondifferential Reinforcement) was exposed 
to the cage forms only for the 2 hr. daily that 
feeding occurred. 

For Group C (Control), no cage forms were 
present during the experiment. This group 
was comparable to the Cornell studies’ control 
groups. x 

In Groups E2 and E2’, where the stimuli 
were present during feeding, a stimulus was 
placed directly over the food tray. The mash 
was of such consistency that it could not be 
carried from the tray and thus had to be 
eaten in close proximity to the stimulus. 

In an attempt to control a possible con- 
founding factor, viz. differences in activity 
level at the time of stimulus change, stimuli 
Were removed from E2 cages and placed in E1 
cages the hour before feeding as often as 
Possible (47% of the time in R1 and 30% in 
R2). This was done to equate activity level 
of these groups at the time of stimulus change 
with that of Groups El and E2', where 
stimulus change occurred at feeding, a period 
of high activity (Reid & Finger, 1955). It 
Was expected that all four experimental 
groups would thereby "notice" the stimulus 
Change equally because of approximately 
equal activity level. When the stimuli were 
removed from E2 cages and placed in p. 
Cages the hour before feeding, an interval 0 
approximately 2 min. was interposed between 
removal of the stimuli from E1’ cages and 
their replacement in E2 cages, and the placing 
9f the food trays in the cages. This was done 
to dispel the signaling effect that stimulus 
Change may have had: the limited memory 
capacity of the albino rat (Honzik, 1931) 
made it seem unlikely that a relationship 
between stimulus replacement (E2) or re- 
moval (E1’) and appearance of food would be 
made, 

A single 1-hr. daily feeding was begun at 
age 86 days in preparation for the circle- 
triangle discrimination learning task at age 

0 days. This 24-hr. schedule was continued 
Until the end of the experiment, and appro- 
Priate adjustments were made in stimulus 
Placement procedures during this period. 


Discrimination Training 


The apparatus was a modified Grice (1948) 
discrimination apparatus as described else- 
where (Gibson & Walk, 1956). Pretraining 
and discrimination training followed the 
procedures described in Gibson and Walk 
(1956). The S could obtain a pinch of wet 
mash fromone of two stimulus panels—a black 
circle, or a black triangle, on a white back- 
ground—by pushing open the correct stimulus 
door. The incorrect stimulus panel was 
baited but locked. The only differences be- 
tween the Gibson and Walk (1956) procedure 
and the present one was that a 60-sec. inter- 
trial interval was not adhered to, and 5s were 
run for 40 days if they had not reached 
criterion, rather than 15 days. (See Kerpel- 
man, 1963 for further procedural details.) 


RESULTS 


The 79 Ss chosen from the pool of 
105 animals were selected before dis- 
crimination training to give as equal 
a distribution as possible within condi- 
tions, replications, and sexes. The re- 
sultant N was 16 in each group, except 
for E1 in which there were 15 Ss due 
to inequality of sexes. For the statis- 
tical analysis, however, the missing S 
in Group E1 was “replaced” by adding 
the appropriate mean subgroup score. 
Means and SDs of the four perform- 
ance measures for al] subgroups? are 
given in Table 1. Since the sub- 
sequent analyses of variance did not 
reveal a significant Conditions X Rep- 
lications interaction, the scores given 
and the analyses of differences be- 
tween them are based on both rep- 
lications combined. 

A summary of the 5 X 2 X 2 anal- 

3 A mean and SD table for the subgroups 
of each replication (Table A), four analysis 
of variance summary tables (Tables B 
through E), and a t-test summary table 
(Table F) have been deposited with the 
American Documentation Institute. Order 
Document No. 8182 from ADI Auxiliary 
Publications Project, Photoduplication Serv- 
ice, Library of Congress, Washington, D. C. 
20540. Remit in advance $1.25 for microfilm 
or $1.25 for photocopies and make checks 
payable to: Chief, Photoduplication Service, 
Library of Congress. 
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TABLE 1 
MEANS AND SDs or THE TREATMENT CONDITIONS FOR R1 AND R2 COMBINED 
Measures 
Groups Trials to Criterion Initial Errors | Repetitive Errors Total Errors 
M | SD M p | M | SD M SD 
E1 267.13 | 82.77 108.25 40.21 | 136.81 82.02 245.06 | 119.94 
E2 188.63 90.12 71.25 38.63 82.50 55.40 153.75 83.98 
C 249.69 63.37 97.69 30.84 108.88 49.60 206.56 69.03 
Et’ 205.25 70.75 75.25 31.80 80.44 53.42 155.69 83.06 
E2' 189.31 42.45 70.19 19.43 78.50 30.91 148.69 46.75 


yses of variance (Conditions X Posi- 
tive Stimuli X Replications) per- 
formed on each measure is given in 
Table 2. The level of statistical 
significance used throughout was 
p «.05. In interpreting the results 
it should be noted that the four meas- 
ures are related: significant results 
across all four are not, therefore, in- 
dependent failures to support the null 
hypothesis. 

A significant Conditions main effect 
is readily apparent. A Replications 
main effect is strongly suggested, 
manifested by all groups in R1 per- 
forming better than their counterparts 
in R2, with the exception of Group 


for which circle was positive to make 
more errors, except, again, for Ss 
in Group E2'. Conditions X Sexes 
X Replications analyses of variance, 
with cell frequencies adjusted for 
equal N, were done on all measures. 
The only significant findings were a 
Conditions main effect for all meas- 
ures and a Sexes X Replications inter- 
action for two measures, the interac- 
tion most likely reflecting confounding 
of sex with time of day of running. 
In light of the consistently signifi- 
cant Conditions main effect, / tests of 
differences between condition means 
relevant to the research hypotheses 
were performed, the estimated error 


E2'. A Stimulus main effectisslightly variance being based on the error 
suggested, shown by a tendency for Ss mean square of the Conditions 
TABLE 2 
ANALYSES OF VARIANCE OF FOUR PERFORMANCE MEASURES 
E Trials to Criterion Initial Errors Repetitive Errors Total Errors 

Source df 
MS F MS F MS F MS F 
ditions (C) 4 20983 | 3.92** | 4816 ga 10208 | 3.09* | 28721 | 4.07** 
Sea (S) 1 | 16245 | 3.04 | 3432 |3.15 | 19035 | $76* | 38632 | 5.47* 
Replications (R) 1 8736 | 1.63 5746 | 5.27 15569 | 4.71* | 40231 | 5.70* 
CxS 4 | 9396 |1.76 | 1647 | 1.51 4237 | 1.28 | 10432 | 1.48 
CXR 4 | 4476| «1 | 1106 | 1.01 2395 | <1 | 5468| <1 
SXR 1 13| «1 106 | «1 719 | <1 | 1378 | <1 
CXSXR 4 4163 | «1 633 ES 60| <1 1036 | <1 
Within 59» | 5351 1091 3305 7061 
Total 78 
caer df subtracted because of “replaced” E1 S. 


1 u! 
*p < .05. 
1p oL 
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TABLE 3 


¿SULTS OF / TESTS RELEVANT TO THE RESEARCH HYPOTHESES 


IN TERMS OF STATISTICAL SIGNIFICANCE 


2-Hr. Groups 


Measure PPA FN | 
Mesure Differentiation 


Enrichment 
Prediction 
E2 >E1> C^ 


Enrichment 
" . Prediction 
SE sÉ 


Differentiation 
Predicti 


E2’ = 


‘Trials to criterion 
Initial errors 

Repetitive errors 
‘Total errors | 


| 22-Hr. Groups 
| 
| 
| 
| 


^ Due to the possible operation nco: 
the absence of food, exact a priori predictions of 
framework, 
, „è Sign > indicates 
significantly, although they Y di 
differ significantly from C, nor C from E1, yet 


may have differed non 


X Stimuli X Replications analyses. 
The results, in terms of statistical 
significance, are summarized in 
Table 3. 
DISCUSSION 

22-Hr. Groups 

The 22-hr. groups were comparable to 
the typical Cornell experimental group. 
The only difference in experimental 
conditions between Groups El and E2 
was that, for the latter, feeding occurred 
in the presence of the cage forms; 
Whereas for the former, the stimuli were 
absent during feeding. Yet E2 per- 
formed on the discrimination task signifi- 
cantly better than E1 and, on two meas- 
ures, than C. The results suggest that 
the facilitating effect of preexposure = 
visually presented forms found in the 
Cornell studies derived predominantly 
from the nondifferential reinforcement 
Ss received in the presence of those 
forms. That the results of those studies 
are explicable in terms of rinforue E 
theory tends to weaken. the empirica 
Support for the differentiation fe. 
Furthermore, Spence's (1936) specific 
hypothesis concerning the effects of non- 
differential reinforcement received sup- 
port from these findings- 


2-IHr. Groups 


While the E2'-C differences mi pi 
theasures were in ne with enueamen 


ficantly better performance. 
icantly. F nple 
E2 still may have differed significantly from E1. 


ion of uncontrollable reinforcement contingencies in the presence of the stimuli in 
f the E1-C difference could not be made within the enrichment 


Sign = indicates performance levels did not differ 
For example, E2 — C — E1 means E2 did not 


predictions, if food reinforcement were 
the only factor accounting for learning 
enhancement on the discrimination task, 
then Group E2' would have been ex- 
pected to perform better than Group E1’. 
The values of the performance means 
were in this direction, but the differences 
were not significant. These results are at 
least as much in line with differentiation 
as with enrichment predictions. If, in 
the differentiation framework, 2 hr. per 
day were not enough time for one group 
to differentiate the stimulus elements, 
then in neither group would facilitation 
of learning be expected. On the other 
hand, differentiation theory cannot ex- 
plain why E2’ performed better than C 
on two measures while E1’ did not, since 
both 2-hr. groups had the same amount 
of experience with the stimuli. 


Novelty and Stimulus Change 


The apparent inability of either posi- 
tion to account for the results of the 2-hr. 
groups can be resolved by introducing an 
additional explanatory concept, viz. the 
(nondifferentially) reinforcing effects of 
novelty and stimulus change. This con- 
cept postulates, consistent with recent 
work (Berlyne, 1960; Butler & Harlow, 
1954; Dember, 1960; Forgus, 1958), 
that not only can food reinforcement 
account for the raising of excitatory 
tendency, but also reinforcement due to 
novelty and perceptual change can act 
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similarly. Applying this concept, the 
cage forms in Group E1’, appearing as 
they did for only two 1-hr. periods daily, 
were novel stimuli. It may be postulated 
that Ss were attracted to them, in that 
their appearance satisfied the explora- 
tory, change-seeking drive (Dember, 
1960). Asa result, the performance level 
of Group E1’ was brought up to a level 
wherein significant differences between 
the 2-hr. groups were eradicated. 

If perceptual change is postulated to 
have reinforcing properties, the question 
of why E1’ did not perform significantly 
better than C (as did E2' on two meas- 
ures) must be approached. It can only 
be assumed that the effects of this kind 
of reinforcement are quantitatively not 
as great as the corresponding mechan- 
isms for hunger. The “novelty reinforce- 
ment" group (E1’), while performing 
well enough to erase significant differ- 
ences between it and E2’, did not perform 
well enough to differ significantly from C. 
This same assumption about the relative 
strength of novelty and food reinforce- 
ment holds for the failure to find signifi- 
cant differences between Groups E2 and 
E1’ as well (two-tailed ¢ tests between 
these groups on all measures were not 
significant). 

Future research might test the validity 
of the novelty hypothesis by varying the 
amount of daily form  preexposure, 
divorced from food reinforcement. It 
would be hypothesized that the longer a 
stimulus is viewed, the less it will 
continue to reinforce the perceptual 
change drive. Two extremes of such a 
situation are the groups in the present 
study in which the stimuli were present 
in the absence of food. Group E1’, in 
which the stimuli were present 2 hr. 
daily; performed better than Group E1, 
in which the stimuli were present 22 hr. 
Two-tailed £ tests on all four measures 
were significant (p< 025), thus sup- 
porting the novelty hypothesis. 
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DIFFERENTIAL VISUAL FEEDBACK 
OF COMPONENT MOTIONS 
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New closed-circuit television techniq 


ues allowed the joint action of S's 


hand, control-instrument, and operational effects to be visually fed 


back singly or in combination. 


8 visual feedback conditions and 2 


levels of task precision were used. The results showed that the effects 
of visual feedback were determined by the component motions fed back, 


with the operational effects 
control-instrument and hand-arm mo 
between visual feedback 
feedback of the operational componer 


being the most important followed by 
vements. A significant interaction 
and precision of movement occurred, where 


at became more important as more 


overall precision of movement was demanded. 


One way of considering behavior is 
in terms of sensory feedback, and a 
proper experiment within this frame- 
work is to assess performance while 
controlling such feedback. This re- 
search was concerned with the rela- 
tionships of vision and aided motion 
and studied the effects of visual feed- 
back of the different component 
motions involved in a complex motor 
control task. In all aided motion 
there are three main movement com- 
ponents: differentiated body move- 
ments such as those of the fingers, 
arm, hand, foot, or torso; movement 
of the tool, instrument or control; and 
movement of the objects or material 
being controlled or changed. | The 
sensory feedback resulting in all 
modalities from these three noninde- 
pendent component movements may 
be designated as reactive, instrumental, 
and operational feedback, respectively 
(Gould, 1964; Smith, 1962). 

Previous studies have shown that 
modifications in one or more sense 
modality of any one of the three main 
feedback mechanisms may indee 
affect behavior. In the tracking 
literature, for example, evaluation of 
of H. William 
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kinesthetic cues from various types of 
controls or instrumental effects while 
keeping other feedback variables con- 
stant showed significant differences 
(Chernikoff, Birmingham, & Taylor, 
1955; Howland & Noble, 1953). 
Likewise, the differences between 
pursuit and compensatory tracking 
(Briggs, Fitts, & Bahrick, 1957) were 
due to visual operational feedback 
effects, while directionality of move- 
ment relative to the operator for a 
given control changed the effects of 
reactive feedback (Grether, 1947). 

The present study was concerned 
with modification of the visual feed- 
back of the three component motions. 
To accomplish this, a closed-circuit 
television system was designed which 
allowed real-time visual feedback of 
any one of the three component 
motions in a tool-using task, singly 
or in various combinations, where Ss 
used a pliers to move a set of pins into 
prescribed holes. This technique 
allowed vision, for example, of the 
pliers without the pins or hand of the 
S being seen. 

Two hypotheses were tested. The 
relative effects of vision are dependent 
upon the precision of the overall 
motion pattern involved; i.e., the 
more precise the overall motion pat- 
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tern the more behavior should be 
affected by reducing visual feedback. 
This hypothesis was tested by varying 
the size of “target” holes to which the 
pins had to be moved. Secondly, for 
a given overall motion pattern lack of 
vision of the most complex of the 
three motion components should affect 
behavior the most, while lack of vision 
of the least complex will affect be- 
havior the least. Complexity is some- 
what loosely defined as one extreme 
of a general dimension underlying 
motion, namely the degree of invari- 
ance or intrinsic organization asso- 
ciated with or demanded by a given 
movement component. Where little 
or no freedom of variation is tolerated 
the movement component is said to 
be complex; where much variation in 
the execution of a component move- 
ment may occur it is said to be 
relatively simple. Difficulty of move- 
ment or degree of organization may 
well be just as appropriate terms. In 
this study the experimental task was 
designed to make the operational 
component motion the most complex, 
followed by instrumental and reactive 
components. 


METHOD 


'The experimental task consisted of suc- 
cessively transporting plastic steel-shafted 
pins, using a pair of thin long nose pliers, from 
one set of holes to another. The pins were 
arranged upright in a vertical line at the left 
edge of the 113 X 22 in. black workboard, 
which was inclined 45° and contained three 
3. and three §-in. "target" holes, correspond- 
ing to the two levels of precision of overall 
movement. An S grasped the specially 
grooved pins with the pliers and plunged them 
into a 4in. layer of soft sponge rubber 
beneath the six target holes in a specified 
order, doing all of one size in succession. Tha 
rubber pad was changed at regular intervals 

eterioration. d 
x Te main components of motion iz 
this task, then, were an S s hand and RS "i 
pliers, and the pins which Parre ra e ; 
respectively, to reactive, instrumenta i a 
operational effects. To provide feedback o 
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only one or combinations of these movements, 
a high-resolution closed-circuit television 
system was used. The Ss did not look 


rectly at the workboard, which was to their 
right and separated from them by a cloth 
screen, but instead viewed their performance 
on a television monitor (Conrac Model CN 
A8/C). A television camera (General Electric 
Model 4TE15B2) with 1-in. lens was mounted 
in front of and perpendicular to the work- 
board. By turning down the brightness and 
beam controls of the television system and 
elevating contrast and target abnormally the 
picture on the monitor was such that only 
white objects could be seen by .S while black 
stimuli were not differentiated from the back- 
ground. Accordingly, depending upon the 
feedback condition, Ss wore a black or white 
glove and used a black or white pliers to move 
a set of black or white pins into the six white 
target holes. 

Seven feedback conditions and a control 
were used, as defined by what S saw: hand- 
pliers-pins or in the terminology of this paper 
reactive-instrumental-operational (RIO), re- 
active-instrumental (RI), instrumental-opera- 
tional (10), reactive-operational (RO), re- 
active only (R), instrumental only (I), 
operational only (O), and a control (C) that 
wore no glove and viewed a normal television 
picture. In all cases the target holes were 
white and thus always seen on the monitor, 
as was a $-in. vertical white paper strip placed 
adjacent to the column of pins to indicate 
their position. Accordingly the picture on the 
television screen that S saw was entirely black 
except for the white target holes and white 
strip, plus the additional feedback provided 
in a particular experimental condition, (For 
example, in the IO condition, in addition to 
the holes and indicating strip, the pliers were 
seen as moving toward the visible pins, al- 
though S's hand was invisible.) In the condi- 
tions where the pliers and/or pins were black 
and consequently invisible against the black 
background of the work board, a small seg- 
ment of their outline could be detected when 
passed over the white strip and target holes. 

Six female Ss, paid volunteers from a 
nearby college, performed four trials at each 
of the eight conditions and two levels of 
precision or target hole size daily over a 
period of 10 days. A trial consisted of putting 
three pins into three holes, Order of feedback 
and order of hole size were randomized be- 
tween and within Ss, although each S received 
the same order of feedback on each of the 10 
days. Time to completion was the dependent 
variable and was recorded on two clocks, one 
for each size of target hole, accurate to .001 
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min. Thus five variables—feedback, hole 
size, days, trials within days, and Ss—were 
represented in the analysis of variance design, 
and while all 3,840 scores were analyzed the 
results for practice are presented in terms of 
the days variable only. 


RESULTS 


By providing visual feedback of 
only certain components of motion 
with the remaining component(s) not 
fed back performance was differ- 
entially affected (p < .001) depending 
upon the movement pattern that .Ss 
Saw. Figure 1, where average time to 
completion of a trial in each feedback 
condition is plotted against days of 
practice, shows that while some learn- 
ing occurred at all conditions there 
Was a general trend toward more 
learning at the more difficult feedback 
conditions (Days X Feedback, p «.01). 
The results of RIO, IO, and Control 
feedback had very similar effects upon 
performance and are thus shown as a 
single curve in Fig. 1. A summary of 
the analysis of variance of the time 
Scores on which the above two sta- 
tistical conclusions as well as subse- 
quent statistical results are based is 
Shown in Table 1. 
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TABLE 1 


ANALYSIS OF VARIANCE OF TIME Scorers 
(THOUSANDS or A MINUTE) 


Source df MS F 
Feedback (F) 7 2.4807 | 49.71*** 
Hole size (H) 1 2.8534 | 37.94*** 
Days (D) 9 .6261 6.56*** 
Ss | 5 4258 
FXH | i -2100 | 10.19*** 
FXD 63 .0579 1.88** 
F X Ss 35 -0499 
H XD 9 | (0461 | 3.14** 
H X Ss 5 .0752 
D X Ss 45 .0955 
FXH 63 .0151 1.22 
FXH 35 .0206 
FED aoe | fe 
H XD | 48 | 0147 
FXH 

X. 315 .0124 

l'otal 959 
**p <.01. 
wep e .001. 


By analyzing the differences among 
the overall means of the eight feed- 
back conditions it was found that 
those conditions where no operational 
feedback was provided were signifi- 
cantly more difficult than those in 
which Ss saw the pins. This was true 
both on the first day of practice and 
on the last day also. In addition, 
instrumental feedback was more im- 
portant in this task than reactive 
feedback. Specifically, using range 
tests (Duncan, 1955) there were three 
groups of means which differed sig- 
nificantly from each other at the .01 
level of confidence: (a) Control (.165 
min., RIO (.162), IO (.169), RO 
(183), and O (188); (b) I (255) and 
RI (.257); and (c) R (.369). Means 
within any one group were not sig- 
nificantly different. Thus the effects 
of feedback fall into three classes: (a) 
those four experimental conditions 
providing operational feedback, re- 
gardless of additional feedback (O+); 
(b) those two conditions providing 
instrumental feedback but no opera- 
tional feedback (OI--); and (c) the 
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single condition providing neither in- 
strumental nor operational feedback 
(OD. 

It should be noted that when all the 
raw-time scores were converted to log 
values and then analyzed the resulting 
curves and statistical conclusions were 
essentially the same as presented here. 

As shown in Table 1, the overall 
data were also analyzed to determine 
the interaction between feedback and 
the overall precision of movement, 
as defined by the two sizes of target 
holes. Performance was naturally 
better (p< .001) on the large holes 
than the small holes. In addition, 
with practice performance became 
increasingly better on the large holes 
relative to the small holes (Size 

X Days, p < .01). Figure 2, where 
average time to completion on each 
hole size is plotted against the eight 
feedback conditions and summed over 
all scores, shows that the differences 
between placement of the pins in the 
large and small holes became increas- 
ingly greater as feedback became less 
helpful. That is, a significant inter- 
action between feedback and size of 
hole (p < .001) occurred, where dif- 
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of feedback. 
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ferences in performance between hole 
size became greater when no opera- 
tional feedback was provided. The 
triple interaction of Feedback X Size 
X Days was not significant. 

Duncan range tests were run on the 
16 means of Fig. 2, as well as the 
corresponding means of Day 10 alone. 
On the overall data the 16 means fell 
into four groups, each group differing 
from each other at the .01 level of 
confidence. Letting lower-case char- 
acters represent performance on the 
small holes and upper-case represent 
performance on the large holes, the 
groups of means from worst to best 
were: {r}; {R, ri, i, I, RI}; (i, I, RI, 
ro, 0) ; (I, RI, ro, o, io, O, c, rio, RO, 
C, IO, RIOJ. Means within a par- 
ticular group were not significantly 
different. The range analysis of Day 
10 showed two groups of means differ- 
ing at the .01 level: (r, R, ri, i) and 
{R, i, ri, io, o, RI, I, c, ro, rio, O, C, 
RO, IO, RIO}. Overall, just the 
feedback condition where S saw only 
his hand (R) differed significantly as 
a function of hole size, while Cond. 
RI vs. ri and I vs. i just missed the 5% 
confidence level. On Day 10 no 
significant differences occurred within 
any feedback condition as a function 
of hole size. Considering performance 
on the two hole sizes as a function of 
feedback, the main finding was that 
overall time scores on the large holes 
did not differ for the seven easiest 
feedback conditions while perform- 
ance on the small target holes did not 
differ significantly for the easiest five 
feedback conditions (Fig. 2). On 
Day 10 performance under R feedback 
on the small holes differed significantly 
from the other 15 conditions. No 
significant differences were observed 
within this latter group. By pooling 
the overall data to form six groups 
corresponding to performance on the 
two hole sizes for the three signifi- 
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cantly different feedback groups de- 
termined above (O+, OI+, OI) only 
the OI feedback condition differed 
significantly as a function of hole size. 

At the end of the 10-day experi- 
ment Ss attempted to fill the holes 
under two additional conditions: when 
they could not see their hand, pliers, 
or pins; and when, in addition, they 
could not see the holes or the thin 
white strip indicating the location of 
the pins also. These conditions were 
not included in the original eight be- 
cause of their difficulty and they were 
run at the end of the experiment to 
determine the degree to which Ss were 
Still relying upon vision in this well- 
practiced task. In both cases per- 
formance was significantly poorer 
than in the main study, the completely 
blind condition being nearly im- 
possible. 

DisCUSSION 

The main results of this study, using a 
television system to control visual feed- 
back of component movements, indicated 
Derformance was dependent both upon 
the components that were fed back and 
also the nature of the task itself. That 
is, the more precision demanded in the 
Overall motion pattern, the more !mpor- 
tant was visual feedback, a fact confirm- 
ing one of the predicted hypotheses. 

ithin each level of precision the same 
relative order of importance of visual 
feedback was observed, with operational 
feedback affecting performance the most 
and reactive feedback being of least 
Significance. In addition, an interaction 
occurred between feedback and level of 
Precision, where operational feedback be- 
came even more important as the re- 
quired overall movement became more 
Precise. The rate of learning was shown 
to be a general derivative function of 
visual feedback. h 

The findings of this study provide 
evidence of the interdependency of visual 
Perception and motion, and indicate that 


267 


the effects of visual feedback of com- 
ponent movements are not linear, in- 
dependent, or additive. Rather, for the 
task studied here, performance was 
determined specifically by the com- 
ponent motion fed back in the order 
operational-instrumental-reactive, and 
within this order the addition of lower- 
order feedback did not change perform- 
ance appreciably. That is, operational 
feedback affected behavior the most and 
in the four conditions where operational 
feedback was provided any additional 
visual feedback caused no significant 
improvement in performance. The same 
conclusion is true for instrumental feed- 
back, since no differences were found in 
the two conditions providing instru- 
mental feedback but no operational 
feedback. Finally, the single condition 
providing only reactive feedback was 
most deleterious to behavior. 

The results of this study are inter- 
preted to support the hypothesis that 
the effects of visual feedback are depend- 
ent upon the complexity of the motion 
component involved, since the effects of 
feedback followed the same order as did 
the a priori designed complexity of com- 
ponent movements. Naturally a criti- 
cism of this interpretation may be raised. 
Although the experimental task was 
designed to make the operational com- 
ponent movement the most complex, it 
could be argued that this hierarchy is 
always the case. That this is not an 
entirely valid criticism, however, was 
shown in a study reported in Smith and 
Smith (1962) on simple tracing of an 
outline. Using three feedback conditions 
corresponding to RIO, RI, and O in the 
terminology of this paper no difference 
in overall performance was found. 

A second criticism, related to the first, 
is that what has been called “complexity” 
is not necessarily either an apt word or 
descriptive of the main underlying 
dimension of motion. One alternative 
would be the degree to which each com- 
ponent motion is related to the comple- 
tion of the task. That is, operational 
movement is the final step in the task 
and therefore the most critical. But this 
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is just another way of stating a fixed 
hierarchy in the order R-I-O, and this 
hierarchy has been shown to be invalid. 
On the other hand, the interpretation 
given in Smith and Smith (1962) to their 
many studies indicates that complexity, 
precision, or intrinsic organization is 
indeed the underlying dimension of 
motion. 

A third criticism of this study is that 
hole size may be directly related only to 
the operational component motion and 
thus this independent variable did not 
vary the overall nature of the task as it 
was designed to do. If this were the case 
then it would be expected that the feed- 
back of operational movement would be 
mainly affected by hole size. Toa minor 
extent this is true. Based upon overall 
data, while hole size affected all feedback 
conditions the differences were larger for 
the three conditions where operational 
effects were not fed back yet this differ- 
ence was significant in one condition (R) 
only. To answer fully these questions 
more research is needed wherein the 
complexity of each of the component 
motions is varied independently within 
the same task for several different tasks 
of unequal difficulty. 
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TRAINING AND TIMING 


IN THE GENERALIZATION 


OF A VOLUNTARY RESPONSE! 
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Generalization of 


of training between 0 and 30 trials. 


responding. In view of these find 


in such gener: 
habit mechanism. 
response delivery have the effect 


a voluntary response shows no increase with amounts 


It is associated with short-latency 


ings, the process of response selection 


alization cannot easily be explained solely in terms of a 
It is suggested that factors speeding or slowing 


of altering S's selection among tem- 


porally stacked response tendencies. 


Under some circumstances, a hu- 
man S instructed to give a response 
to a specified stimulus may show a 
graded tendency to deliver that re- 
Sponse to other, similar stimuli in 
proportion to their similarity. Such 
generalization of a voluntary response 
was first described in a paper by 
Gibson (1939). It has been actively 
investigated during the last dozen 
Years, after a paper by Brown, Bilo- 
deau, and Baron (1951) introduced a 
widely used reaction-time technique 
for the study of such generalization. 

'The generalization of a voluntary 
response differs in several respects 
from the generalization of a clas- 
Sically or instrumentally conditioned 
response. While these differences 
have been commented upon by the 
above-mentioned authors, and by 
Mednick and Freedman (1960), the 
interrelations among the kinds of 
generalization found in the various 
paradigms are not completely clear. 

. Generalization of a classically or 
instrumentally conditioned response 
can only occur after the conditioning 
itself, after a series of training trials 
which produce and strengthen an 
rant from the 
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association between training stimulus 
and response. One could, considering 
these paradigms, attempt to explain 
generalization by saying that phys- 
ically similar stimuli partake of the 
habit of responding attached to the 
training stimulus, as Hull (1943) did 
in equations establishing generaliza- 
tion as a manifestation of generalized 
habit strength. Generalization would, 
in such a scheme, be expected to in- 
crease with training. In the case of 
instrumental conditioning, this ap- 
pears to be true (Margolius, 1955). 
In the case of classical conditioning, 
it is at least partially true. Hovland 
(1937) concluded that the generalized 
GSR response increases with moderate 
amounts of training, decreasing some- 
what with still further training. 

Does the generalization of a volun- 
tary response similarly increase with 
training? It is true that Es usually 
administer practice trials before test- 
ing for such generalization, but in 
those practice trials Ss simply re- 
hearse what they were told to do and 
what they did quite quickly and 
efficiently on their very first trial. It 
seemed worthwhile to ask whether 
training augments the generalization 
of a voluntary response, and accord- 
ingly the present studies examined the 
effects of varying amounts of practice 
upon such generalization. 
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Another salient characteristic of the 
reaction-time method of studying 
voluntary-response generalization is 
an apparent dependence upon in- 
structions emphasizing speed.  Ac- 
cordingly, certain steps were taken in 
the present studies to provide for care- 
ful examination of response latencies. 


METHOD 


The procedure used in these studies was a 
version of the reaction-time technique, with 
modifications made with an eye to the taking 
of latency measures. The S was asked to 
make one response to the training stimulus 
and another to any other stimulus, so that a 
latency measure might be obtained when an S 
made either a generalized or a nongeneralized 
response to a test stimulus. The usual re- 
action-time procedure offers S a respond-do 
not respond choice, and the latency of a 
response to a test stimulus can only be taken 
when it is a generalized response. Brown et 
al. (1951) reported that they had tried a two- 
response procedure like that used here, 
obtaining amounts of generalization com- 
parable to those obtained in their more 
usually adopted one-response procedure. 

The stimuli were a series of colored lights 
presented through one window, rather than 
the usual spatial array of lights. Reaction 
time lengthens to visual stimulation as it 
becomes peripheral, and this fact was felt to 
introduce an unnecessary complication into 
interpretation of spatial generalization data. 

White noise was continuously piped to S 
after instructions were given, to Partially 
mask out stray environmental noises. The 
family of reaction-time procedures is quite 
sensitive to auditory distractions, and it was 
hoped that the partial stabilization of audi- 
tory stimulation induced by the noise would 
improve the reliability of the reaction-time 
measures. (Comparisons of noise, no noise 
with some trial Ss suggested that the noise 
level used neither speeded nor slowed simple 
reaction time.) 

Two studies are to be reported. Experi- 
ment I compared two groups of 20 Ss each, 
one group given 5 and the other group 20 
trials of training before generalization tests. 
Experiment II compared three groups of 20 Ss 
each, one group given 0 training, another 
given 30 trials of training, and the third given 
30 trials of “control training” with a light 
quite dissimilar to the test colors. 
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Subjects 


The Ss of Exp. I were sophomore to senior 
undergraduates participating in the study as 
part of a first course in psychology. The Ss 
of the second study were paid freshmen and 
sophomores recruited from biology classes; 
these Ss had had no psychology courses. 


Apparatus 


The apparatus was housed in two stacked 
21 X 15 X 11 in. cabinets. The front face 
which was presented to S was 22 in. tall and 
21 in. wide. Centered 34 in. from the top 
was a l-in. white ready light. Centered 4 in. 
below the ready light was a 33-in. diameter 
circular window of flashed opal glass, through 
Which the stimuli were presented. Atbottom, 
centered, was a metal response panel 23 in. 
wide and 4 in. long. The S could rest the 
heel of his hand upon the table and manipulate 
this lever with his fingers. 

In the upper cabinet, behind the stimulus 
window, was a semicircular array of six 75-w. 
projector bulbs, each connected to its own 
autotransformer to allow adjustments in light 
intensity. A hue series was arranged by 
placing Corning molded glass filters in front 
of five of the bulbs. The hues, and filters, 
were: (a) red, No. 2408; (b) reddish-orange, 
No. 2424 and 2434; (c) orange, No. 3307 and 
3480; (d) yellowish-orange, No. 3486 and 
3482; and (e) an orange-yellow, No. 3484. 
In these studies, Hue a was always the 
training hue, and the others were the test 
hues. The series wa, comparable to one used 
in a different generalization procedure (White 
& Spiker, 1960), except that here subjective 
brightnesses of the stimuli were roughly 
equalized. 

Five preliminary Ss were used as Os in 
method of limits comparisons where the 
brightness of Sı, set constant, was compared 
with varying brightness of each of the other 
hues. Ten estimates of a point of subjective 
equality were obtained from each O for each 
hue, and all estimates were averaged to 
determine a final autotransformer setting for 
each bulb. Reduction of current to a bulb 
can cause it to reach full brightness percep- 
tibly more slowly when the bulb is turned on. 
Examination of the series of hues after the 
psychophysical adjustments indicated that 
the colored lights were not discriminable by 
differing "on-times." Both preliminary and ex- 
perimental work were done ina darkened room. 

The lower cabinet contained the white 
noise generator, and circuitry for a standard 
trial sequence: (a) with the pressing of a 
start button, a preset ready interval of from 
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à to 4 sec. was initiated and the ready light 
came on; (5) at the end of the ready interval, 
the stimulus light came on, and a Standard 
Electric clock started; (c) S's first response, 
up or down, stopped the clock, turned off the 
stimulus and ready lights, and registered on 
indicator lights in back. The trial would not 
begin unless S was holding his lever in the 
middle position. A few Ss found it difficult 
to consistently feel this middle position. To 
help in such cases, a tiny indicator light was 
mounted at the top of the display, which lit 
Whenever the response lever was in the middle 
Position, After each trial, E recorded the 
time of response and whether the response 
had been an up or down hand movement. 


Procedure 


In Exp. I, all Ss were initially instructed to 
lift their hand when they saw Si, practiced 
this three times, and were then further in- 
Structed to press down on the lever whenever 
a different color appeared. Speed was re- 
Peatedly urged. There followed 5 trials of 
Practice on S, for Group 5, or 20 trials for 

roup 20, followed without pause by a test 
Sequence of 12 test trials interspersed among 

Presentations of Sı. Within the test trials, 
the four test stimuli were each given three 
times in a standard counterbalanced, three- 
Cycle arrangement. 

, In Exp. II, Ss in Group 30 were first 
instructed to respond to Si by lifting their 
and as rapidly as possible, and did this for 
trials, The Ss in Group C-30 had equiv- 
alent practice with an unfiltered light. Fol- 
lowing this practice period, the two groups 
Teceived instructions identical to those o 
TOup 0, That is, they were shown Si and 
told to lift their hand whenever seeing this 
Color, but to press down whenever any other 
Color appeared, Again, speed was urged. 
test sequence like that of Exp. I was then 
Completed, 


RESULTS 

Generalized Responses 

During the test sequence, lifting the 
land to S, was scored as correct, 
lifting it to a test stimulus was scored 
aS a generalization error. Table 1 
shows the percentages of such re- 
SPonses given by all groups. There 
Was little difference in total generaliza- 
üon among groups of either study ; if 
anything, increase of practice trials 
Slightly “decreased the incidence of 


TABLE 1 
PERCENT VOLUNTARY GE ALIZATION 
ERRORS AS A FUNCTION OF TRIALS 
OF TRAINING 


Errors to Test Stimuli 


Group (Corset 
S: S; Sı Ss | S:-Ss 
5 (96) 83 | 52 | 42 | 39 | 49 
20 (98) 70 | 56 | 41 | 41 | 47 
0 (92) 72 | 50 | 28 | 27 | 44 
30 (93) 52 | 28 | 30 | 25 | 34 
C-30 (96) 68 | 33 | 15 | 18 | 34 


Note.—N = 20 in each group. 


generalized responses. The Ss of 
Exp. I generalized somewhat more 
than those of Exp. Il, a fact which 
will be discussed below. 

Analyses of variance testing the 
effects of Groups and Stimuli on 
number of generalized responses dur- 
ing the test period were done for Exp. 
I and II separately, and for the two 
experiments combined. The three 
analyses were alike in showing highly 
significant effects for Stimuli, with 
effects of Groups and Groups X Stim- 
uli nowhere significant. 


Latency of Response 

Table 2 gives the geometric mean 
latencies of responses during the test 
sequence for each of the five groups. 
For each S, every response latency 
was. converted to log form, and four 
means were obtained—means for cor- 
rect responses, and for errors, to the 
test stimuli, and means for correct 
responses, and errors, to the 16 
presentations of S, interspersed among 
the test trials. Table 2 shows the 
mean of those means for each group, 
reconverted into seconds because it 
was felt the data would be more com- 
prehensible in the common metric. 
'The indicated statistical analyses 
were, of course, done with the data in 
log form. 

It is evident from the left half of 
Table 2 that an error response on a 


N 
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TABLE 2 


GEOMETRIC MEAN LATEN 
DURING 


S OF 
THE TEST SEQUENCE 


CORRECT RESPONSES AND ERRORS 


Test Stimuli 


Training Stimulus | 


| 
G EN Training 
row | ae — ea | d 
| Correct Error |  Corect | Error Error Error 
5 | ag | ost | o 01 ns 
20 0.63 0.49 0.5. ‘001 ns 
0 0.73 0.66 0.61 | 0 01 
30 0.74 0.67 0.63 | 105 ns 
C-30 0.74 0.66 0.61 | | o ns 
All Ss 0.69 0.59 0.58 | 001 .02 
af test. nid 


test trial tended to be of significantly 
shorter latency than a correct re- 
sponse. The absolute latency differ- 
ence was notlarge. It tended to be of 
about the same magnitude from the 
beginning to the end of the test 
sequence. 

The fact that errors to the test 
stimuli were of shorter latency than 
correct responses might argue that 
impulsiveness of response helped cause 
sucherrors. Another possibility which 
must be dealt with, however, is that 
the error response—uniformly “up” 
for all Ss—was physically easier, and 
thus quicker, than the correct response 
—uniformly “down” for all Ss. Sev- 
eral findings argue against such an in- 
terpretation of the latency differences. 

First, in these studies a significant 
tendency to short-latency errors was 
found during interspersed trials with 
Sı, even though opposite hand motions 
defined correct responses and errors 
on such trials. The right half of 
Table 2 shows this. The effect is not 
so regular here as on the other side of 
the table, but error incidence was very 
low on this type of trial, averaging 0.8 
errors per S over 16 trials and esti- 
mates of error iid were accord- 
ingly much less reliable. 

Ns in other research (White & 
Grim, 1962) a group of 20 Ss was 


included in a procedure comparable to 
that of the present studies, except 
that concurrent GSR recordings were 
taken, and that the instructions called 
for reverse hand motions to those used 
here. For this group, error latencies 
to the test stimuli (0.63 sec.) were 
again shorter than correct response 
latencies (0.67). This latency differ- 
ence was consistent with those found 
in these studies, though not sta- 
tistically significant. 

Third, the analysis to be discussed 
in the following section showed a 
further relationship between quick 
responding and errors which appeared 
difficult to construe as the product of 
a movement artifact. 


Subject Speed and. Errors 


As noted above, Ss of the second 
study generalized less than those of 
the first study. Since the two studies 
were quite comparable in most re- 
spects, about all that could reasonably 
be seen as a probable cause of the 
difference was an atmosphere effect. 
The Ss of the first study, recruited in 
a psychology class with a lab, seemed 
reasonably relaxed when participating 
in the study. The Ss of the second 
study had almost no acquaintance 
with psychology and were visibly 
more diffident. 


TRAINING AND VOLUNTARY GENERALIZATION 


Each S's mean log latency on his 16 
Sı trials during the testing sequence 
was plotted against the number of 
errors he had made during the test 
trials. Figure 1 shows this compari- 
Son; in that figure, six classes of Ss 
with comparable mean latencies are 
grouped. The decline in errors on the 
test trials with increasing latency on 
the S, trials was highly significant, 
with F (5, 94) = 6.47, p < .001. The 
Square dots in the figure indicate mean 
values for all Ss in Exp. I and 11; the 
differences in generalization in the two 
studies could reasonably be linked to 
the relationship pictured in the figure. 
Apparently, the atmosphere effect 
influenced generalization by affecting 
the speed at which Ss were led to pace 
themselves during the task. 


Latency Gradients 

Latency gradients of voluntary 
generalization have been a subject of 
some interest and, accordingly, those 
obtained in these studies are pictured 
in Fig, 2, Analyses of the data 
represented in the figure did not show 
significance for the effects of Stimuli or 
Stimuli X Correctness; however, such 
Statistical analysis was compromised 
by the fact that few Ss gave both a 
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B] Experiment A 


Experiment B. El 


MEAN NUMBER ERRORS, TEST STIMULI 
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correct response and an error to all 
four test stimuli. Corresponding 
curves for each of the five groups were 
grossly comparable to their average, 
pictured here. 


DISCUSSION 


If the generalization of a voluntary 
response does not increase with training, 
and if it is associated with short-latency 
responding, then an explanation of the 
phenomenon would apparently require 
some mechanism in addition to kabit to 
fully describe the process of elicitation 
of generalized or nongeneralized re- 
sponses by test stimuli. Of course, the 
fact that verbal instructions can call 
forth intended behavior must rest on 
habit structures, on S's previous associa- 
tions between words and cues, and words 
and acts. Nevertheless, as the term 
habit is ordinarily understood, a habitual 
association between a cue and an act is 
not instantly installed by instructions; it 
strengthens with practice; it does not 
depend upon an appropriate response 
latency to take effect. 

Practice has been shown to increase the 
generalization of one sort of voluntary 
response (Spiker, 1956a, 1956b). In 
Spiker’s procedure, the response measure 
was a count of the number of handle pulls 
within a 3- or 4-sec. interval. The speed 
and rhythm of such bursts of responding 
may be increased by practice and thus, 
secondarily, practice could augment re- 
corded responsiveness during general- 
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ization tests. However, in a reaction- 
time procedure the learning or smoothing 
of response-response sequences is prob- 
ably of minimal importance, and thus 
training has no apparent effect. 

The influence of short- or long-latency 
responding upon the generalization of a 
voluntary response suggests that the phe- 
nomenon is associated with temporal 
response priorities. In fact, Fig. 3 
exactly illustrates a temporal stacking of 
the two competing responses in the gen- 
eralization test trials of the present 
studies. Perhaps something like the fol- 
lowing is involved in the process of re- 
sponse selection in the present procedure: 

With the first presentation of S, to 
him, S stores an approximate model of 
the stimulus—this is, of course, selí- 
evident in that Ss can recognize some- 
thing they have only seen once. Similarly, 
S stores two response sets as a result of 
instructions—R, if Sı; Re if anything 
else. It seems plausible that the instruc- 
tions establish the responses not as 
simultaneously competing, but in a 
priority order—''R,—and if not, Re.” 

Two delays of response are apparently 
involved in the subsequent use of these 
stored specifications, what might be 
called a general and a contingent delay. 
'The majority of responses during testing 


100% 
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100 % 
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TIME IN SECONDS 
Fic. 3. Proportions of correct or incorrect 
responses delivered at successive (logarith- 
mically equal) intervals after stimulus omer 
(Pictured time-space includes almost all ol 
1,200 test-trial responses.) 
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—correct responses to S;, and errors to 
the test stimuli—were delivered at a 
mean time of .58 sec. In the study, these 
were all “up'’—or, in the terminology 
adopted for discussion, R:—responses. 
This mean time for the first-available 
response is considerably longer than the 
usual simple RT, and consequently it 
seems reasonable to suppose that S is 
operating under a general delay. Such 
a delay, common to all disjunctive RT 
procedures, presumably gives time for 
matching of the perceived stimulus 
against the stored model. 

If a mismatch is detected, then a 
second, contingent delay appears to be 
necessary for delivery of Re. Perhaps it 
is associated with the sort of novelty 
reaction shown in Grings' (1960) per- 
ceptual-disparity response, or the orient- 
ing reflex (Sokolov, 1960). The contin- 
gent delay is quite brief, and may simply 
be a consequence of the fact that response 
must be inhibited until Rs is available. 

One must assume that a response can 
be launched before these delays complete 
themselves, and a response so launched 
will reflect whatever response is available 
atthatinstant. Any long-term condition 
(such as a speed set) or short-term condi- 
tion (such as startle) which forces out a re- 
sponse before a contingent delay will tend 
to induce generalization on a test trial. 

Occasionally, a response may be forced 
out before even general delay is com- 
pleted—in which case, response selection 
would tend to be random. This, it is 
assumed, is the genesis of the few short- 
latency errors to Sı which occurred; such 
responses were the very fastest responses 
given in the testing sequence. 

The above explanation of the response- 
selection process in the generalization of 
a voluntary response is necessarily 
sketchy. It deals with choice of response 
to the generic test stimulus, leaving aside 
important unanswered questions as to 
how similarity between training and test 
stimuli influences the incidence of errors. 
It does, however, offer some grounds for 
understanding of S’s choice between 
speed or accuracy in rational terms and, 
in so doing, it suggests tie-ins among à 
number of significant aspects of the 
generalization literature. 
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It accords with the fact that instruc- 
tions emphasizing speed are so important 
to voluntary generalization (Brown et 
al., 1951), and that impulsive Ss tend 
to be high generalizers (Lacey & Lacey, 
1958). 

It is in line with the fact that associa- 
tive interference errors, logically much 
like generalization errors, are associated 
with short-latency responding (White, 
Spiker, & Holton, 1960). 

It allows a reasonable explanation of 
Mednick's (1955) finding that organic 
patients generalize less than normals. 
While it seems hard to believe that 
heterogeneous brain damage leads. to 
more accurate stimulus differentiation, 
it would be quite in line with what is 
known of organic patients (Goldstein, 
1947) to suggest that they adopt a more 
hesitant, slower speed set on a general- 
ization task. E 

The fact that schizophrenics generalize 
More than normals (Knopf & Fager, 
1959; Mednick, 1955) could under this 
view be linked with ‘‘microgenetic 
Processes. Recently, theory (Flavell & 

raguns, 1957) and rather impressive 
evidence (Werner, 1957) have been 
offered to suggest that schizophrenic 
reactions resemble impulsive reactions 
Usually inhibited by normals. 

Finally, the findings of Thompson 
(1962) appear to be in line with the 
Present sort of reasoning. He has re- 
Ported that removal of auditory cortex 
in cats led to high levels of generalist ao 
apparently by creating a loss of ability 
to form response inhibition. 
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EFFECTS OF STIMULUS CHANGE UPON 


TEE 


GSR AND REACTION TIME? 
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60 Ss were given 16 RT trials using a colored light as stimulus and then, 
without warning, lights differing in color but not in intensity were 
presented. Augmented GSR reactions occurred as a monotonic func- 


tion of amount of change. 


Lengthened RTs were also observed; the 


amount of such lengthening did not appear to be simply related either 
to evoked GSR or to amount of stimulus change. 


When Ss are presented with a series 
of conditioning trials, and neither the 
stimulus nor the reinforcement under- 
goes a discernible change, S appears 
to become habituated to the constant 
characteristics of his repeated ex- 
periences. If a change then occurs in 
either the stimulus or the reinforce- 
ment, a drop in skin resistance occurs; 
this GSR reaction has been referred to 
asa perceptual disparity response. The 
magnitude of the perceptual dis- 
parity response is a function of the 
number and proportion of previous 
experiences which have been consis- 
tent with one another, and of the 
amount and direction of the stimulus 
change involved (Grings, 1960). 

Kimmel (1960), using auditory 
stimuli, studied the effect of changes 
in stimulus intensity upon the percep- 
iual disparity response, and found 
that amount of GSR reaction related 
monotonically to the extent of stim- 
ulus change. The present study 
sought to determine the extent of 
perceptual disparity responses to a 
series of colors for which generaliza- 
tion data had previously been ob- 
tained. The intent of the study was 
to look for possible points of connec- 


1Study supported by a grant from the 
Ford Foundation in Behavioral Sciences and 
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No. M-3639. The principal responsibi ity M 
the interpretations made of these data res 
with the second author. 


tion between the perceptual disparity 
response and generalization. Record- 
ings of reaction time were made, to 
see if the perceptual disparity response 
is accompanied by any change in 
timing of response to a stimulus, since 
response timing appears to be a 
significant influence on the generaliza- 
tion of a voluntary response (White, 
1965). 


METHOD 


Apparatus.—The basic apparatus has been 
described in some detail in White (1965). 
Essentially, it consisted of two stacked cabi- 
nets, the top one containing lights and filters 
for the presentation of a series of illuminated 
colors, the bottom cabinet presenting to Sa 
response lever, and containing switching and 
timing circuitry for a simple RT procedure, 

Continuous GSR recordings were obtained 
using a Fels dermohmmeter, Model 22A, 
connected to a 1-ma. Esterline-Angus pen 
recorder. Subject current was 70 uà. Elec- 
trodes were circular 1j-in. zinc disks, set in a 
g-in. deep Plexiglas cup filled with electrode 
jelly supplied by the instrument manufacturer 
(Yellow Springs Instrument Company, In- 
corporated, Yellow Springs, Ohio). 

Stimuli.—Stimuli were presented through 
a 34-in. diameter circular window of flashed 
opal glass. Behind the window was a semi- 
circular array of five 75-w, projector bulbs, 
cach connected to its own autotransformer 
toallow adjustments in lightintensity. A hue 
series was arranged by placing Corning 
molded glass filters in front of these bulbs. 
The hues, and filters, were: (a) red, No. 2408; 
(b) reddish-orange, No. 2424 and 2434; 
(c) orange, No. 3307 and 3180; (d) yellowish- 
orange, No. 3486 and 3482; and (e) an 
orange-yellow, No. 3484. This series, or à 
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closely similar series, had yielded monotonic 
generalization gradients in a number of previ- 
ous studies (White, 1965; White & Grim, 
1962; White & Spiker, 1960). 

An attempt was made to roughly equate 
subjective brightnesses of the stimuli through 
the following procedure. Five preliminary Ss 
were used as Os in method of limits compari- 
sons where the brightness of Si, set constant, 
was compared with varying brightnesses of 
each of the other hues. Ten estimates of a 
point of subjective equality were obtained 
from each O for each hue, and all estimates 
Were averaged to determine a final autotrans- 
former setting for each bulb. Both prelimin- 
ary and experimental work were done in a 
darkened room. 

Subjects.—Sixty undergraduates at the 

"niversity of Chicago served as 5s. Twenty- 
three were from a psychology course, and 
fulfilled a laboratory requirement by par- 
licipating in the experiment; these Ss were 
given only one "change" trial and. will be 
termed Group A. Thirty-seven Ss were paid 
Volunteers recruited from an undergraduate 
biology course; these Ss were given four 
Successive change trials and will be termed 
Group B. Except for their differing numbers 
of change trials, both groups experienced 
identical procedures. 

Procedure.—Electrodes were attached to 
the Ss 20 min. prior to the experiment, m 
Order to allow time for the electrode jelly to 
hydrate theskin. Both electrodes were placed 
9n the palm of the left hand, one just under 
the. index finger and the other a maxima 
distance away on the heel of the hand. 
Plexiglas spring clamps held the electrodes in 
Position, i " 

After the preliminary period, spent in 2 
Waiting room, S was brought into the expen- 
Mental room and seated before a table qe] 
the reaction-time apparatus. He was S * 
to use his free right hand to press the T p 
While avoiding tensing or moving the en 
hand, which rested on the table. A S 
Written instructions explained the task. T. 
Ss were instructed to maintain the ps 
ever in a middle position until the hia p 
lighted, and then to press down on the lev 
as quickly as possible. —— 

The S e then given 16 habia le? 
trials to § , red stimulus. On the dew 
trial, without pause or warning, one of ks 
Other four stimuli was presented. Fifteen a 
received each one of the four change sumat 
on the seventeenth trial. On Trials 18-20, 
the 23 Ss of Group A returned to Sı again, 
While the 37 Ss of Group B continued to 


receive the change stimulus assigned to them 
on Trial 17. 

On each trial, latency of response was 
recorded on a Standard Electric timer to 
hundredths of a second and, for analytic pur- 
poses, such latencies were later converted to 
logs. A GSR reaction was scored if a drop in 
resistance occurred within 1-3 sec. after 
stimulus presentation ; the resistance readings 
were converted to log conductance change 
measures. 

Consideration of possible GSR artifacts.— 
The procedure and recordings offered the 
possibility of several artifacts in the GSR 
records—due to scale changes, polarization, 
and movement—which, though they were not 
felt to be critical, do require some discussion 
in the context of a description of method. 

The Fels dermohmmeter maintains both 
range and sensitivity by an automatic scale 
change feature, which causes a full-scale 
return sweep of the pen whenever it ap- 
proaches maximum excursion. About 6% of 
the time, an automatic scale change of the 
dermohmmeter occurred immediately after a 
trial, and the consequent full-scale sweep of 
the galvanometer pen made it impossible to 
score a GSR reaction for that trial. This 
introduction of unscorable trials was, un- 
fortunately, not random but tended to obscure 
more large than small GSR reactions—the 
larger reactions being more likely to bring the 
pen to the point of scale change. ‘The effects 
of such an artifact were judged to be es- 
sentially neutral for experimental purposes, 
since it would tend to reduce both treatment 
and error GSR variance. 

Electrode polarization, always a factor to 
be considered in GSR work, probably had 
little influence on the results of this study. 
The actual recording went on for only 10 min. 
or so, too brief a time for much polarization 
to occur. (Occasionally, the two zinc discs 
were touched together just after removal from 
an S, and little or no voltage differences were 
observed on the instrument's galvanometer.) 
Even had more polarization occurred, it is 
dubious whether it could have significantly 
altered a study devoted, as this one, to 
resistance change rather than basal resistance 
data. 

Finally, the fact that one of S’s hands was 
working while the other held the GSR 
electrodes opened the door to the possi ility 
that the GSR records contained a component 
of movement artifacts: that is, that the GSR 
pen might have at times acted as a stabilim- 
eter recorder as well as a resistance recorder. 
It was felt that movement artifacts did not 
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Fic. 1. Mean GSR reaction (mean log conductance change) on each trial of the study. 
(Group A results are pictured in the bottom part of the figure, Group B results above.) 


significantly influence the obtained data be- 
cause; (a) pretesting suggested that the 
electrodes on the left hand were not very 
sensitive to movements of the right hand; 
(b) when movement artifacts were observed, 
they gave a characteristic sharply swinging 
pen record—and such were rarely seen in the 
experimental records; and (c) the movement 
responses which might induce an artifact in 
the study came about .5 sec. after stimulus 
onset, while GSR reactions were scored in the 
interval from 1 to 3 sec. after onset. 


RESULTS 


Perceptual disparity res ponses.— 
Figure 1 pictures mean GSR reactions 
for each trial of the experiment. The 
data for Group A, which received only 
one change trial, are pictured at the 
bottom of the figure, the data for 
Group B, with four change trials, at 
the top. 

GSR responsiveness, high during 
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the first few habituation trials, settled 
down to a low and fairly steady level 
within a few trials. The decline in 
mean GSR reaction from Trial 1 to 
Trial 16 was highly significant, with 
t (48) = 4.60, p < .001. (As with 
some other GSR comparisons to be 
reported, less than the total sample V 
were involved because of unscorable 
GSR trials, as noted above.) — . 
Trial 17, the first change trial, in- 
duced a markedly elevated GSR 
reaction, significantly different from 
the reaction on Trial 16. Here £ (57) 
= 2.9, p «.01. The size of the 
reaction was proportional to the 
amount of stimulus change (Fig. 2). 
A simple analysis of variance, com- 
paring seventeenth-trial GSRs to Ss, 
Ss, S4, and S; yielded an F (3; 56) of 
6.58, significant at the .005 level. 
When S returned to Sı after the 
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Change and geometric mean RT on Tdi * 
the first change trial, for subgroups © be 
assigned to the four change stimuli. e 
Stippled zones indicate the range ¢ ae A 
response measures over previous habituate! 
trials 6-16.) 


TABLE 1 


LATENCY OF RESPONSE AND GSR REACTIONS 


DURING HABITUATION AND CHANGE TRIALS 
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Habituation Change Trials 


Range, 
Juaséciagl SS y S an 


Mean GSR Log Conductance Change 


A 23 


.007 | .006 | .007 
S 33| 005-007 |.011 


010 | 006 | 007 


Geometric Mean Reaction Time 


ur 


23 0.4 
33 | 0.39-0.42 |046 | 0.4 


in Oa 


change trial, his GSR reactivity im- 
mediately returned to normal range. 
If S remained with his change stim- 
ulus, the return to normal range ap- 
peared to be delayed by a single trial 
(Table 1). 

Reaction-time data.—The trends ob- 
served in the reaction-time data were 
grossly similar to those found in the 
GSR data. Reaction time quickened 
from Trial 1 to Trial 16, with ¢ (59) 
= 3.71, p < .001. The extent of the 
change was from a geometric mean 
time of 0.46 sec. to 0.41 sec. On the 
first change trial, a lengthened RT 
was observed, differing significantly 
from the RT measure on Trial 16, 
with ż (58) = 3.59, p < .001. The 
return to normal range was rapid for 
Group A, somewhat slower for Group 
B; on Trial 20, Group B was still 
significantly slower than it had been 
on Trial 16, with £ (36) = 2.08, p < .05 
(Table 1). 

Figure 2 pictures the lengthening of 
RT on the first change trial. The 
differences in RT among the test 
stimuli were not supported by sta- 
tistical tests. An analysis of variance 
comparing mean RTs to S», Ss, S4, and 
Ss; on Trial 17 was not significant, 
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nor was another analysis comparing 
changes in RT from Trial 16 to Trial 
17 for subgroups of Ss assigned to the 
different change stimuli. 

Correlations of RT and GSR.— 
The first few habituation trials were 
marked by long RTs and large GSR 
reactions. These two response char- 
acteristics appeared during the first 
change trial as well. Analyses were 
therefore undertaken to see whether 
RT and GSR measures tended to be 
quantitatively correlated, 

The first such analysis looked for an 
association between an S's average 
levels of RT and GSR response. [t 
was felt necessary for this analysis to 
exclude Ss who rarely gave GSR re- 
sponses. Accordingly, 49 of the 60 Ss 
were selected out as “active” on the 
GSR during habituation, on grounds 
that these Ss gave a nonzero GSR 
on more than half of their scorable 
habituation trials. Each active S's 
average RT over his habituation trials 
was correlated against his average 
GSR reaction over those trials. The 
correlation was .28, significant at the 
-025 level, suggesting that individuals 
who tended to be more reactive on the 
GSR were to some extent also slower 
in RT. 

Would, then, moment-to-moment 
variations in a given individual's GSR 
responsiveness show a similar relation- 


TABLE 2 
FREQUENCY DISTRIBUTION OF 49 INDIVIDUAL 
CoRRELATION COEFFICIENTS RELATING 
GSR anv Loc RT DURING 
HABITUATION TRIALS 

| Value of Correlation 

|——— =. 

| 

| =55 | _ 19 | .08 to | .35 to 

| $95 | to .07 81 
o 2 
Observed |. 12 14 11 12 
go " 2 9 | 18 | 20 


Ship to his variations in speed of re- 
sponse? For each of the 49 active Ss, 
an individual correlation coefficient 
was computed, relating GSR and RT 
over his 16 habituation trials. The re- 
sulting coefficients varied widely from 
individual to individual (Table 2). 
Goodness-of-fit tests, using theoretical 
distributions of r calculated by Fisher's 
z (Walker & Lev, 1953), showed that 
such a distribution of r's would be 
unlikely to occur through sampling 
error from a rho of .28 (x? = 53:56, 
P «.001) However, the obtained 
distribution was not inconsistent with 
sampling error around a rho of 0. 
There was, then, no evidence for con- 
clusion that a given S's RT tended to 
vary from trial to trial with his GSR. 

Finally, a correlation was run com- 
paring Ss' GSR and RT measures on 
Trial 17, and involving the total of 59 
Ss for whom both measures were 
available on this trial. The correla- 
tion was .14, not significantly different 
from 0. 


Discussion 


Kimmel's (1960) study, and the pres- 
ent one, have shown that the GSR is 
responsive in a graded fashion to changes 
in quality and intensity of stimulation. 
As Allen, Hill, and Wickens (1963) have 
pointed out, this fact has significant 
implications for the interpretation. of 
studies of GSR generalization. Unless 
most response systems are as sensitive to 
stimulus change as is the GSR, which 
seems dubious, it is unlikely that GSR 
generalization can be well representative 
of the universe of generalizing behavior. 
This is a consideration of some impor- 
tance, since Hovland's (1937a, 1937b, 
1937c, 1937d) GSR generalization work 
has been a classic and important source 
of theory and data concerning stimulus 
generalization. 

GSR data may be atypical. For ex- 
ample, comparison of studies of intensity 
generalization (White, 1962) shows that 
Hovland's generalization curves are un- 
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like all others reported in that, testing 
towards louder tones, he found response 
strength ascending away from the train- 
ing stimulus. This effect, sometimes 
attributed to energizing properties of 
CS intensity, might involve also the 
responsiveness of the GSR to stimulus 
change. 

Recent data have suggested, in the 
case of voluntary response generalization, 
that generalization errors and correct 
responses are "ready" at different in- 
tervals of time after stimulus onset. 
Short-latency responses are apt to be 
generalization errors, while some sort of 
delay is associated with the delivery of a 
Correct response (White, 1965). 

In the present study, stimulus change 
caused a lengthening of response latency 
by about .04 sec. Five groups of gen- 
eralization Ss (White, 1965) had delay 
associated with a correct response aver- 
aging .07, .07, .08, .12, and .14 sec. (com- 
Daring geometric mean latencies of cor- 
tect responses and errors to generaliza- 
tion test stimuli). The delay induced by 
the perceptual disparity reaction 15 à 
little smaller, but of the same order of 
Magnitude, as the delay involved in 
delivery of a correct response in 2 
generalization test. 

However, it seems unlikely that the 
Perceptual disparity process, as it 1s now 
known, could act as the perfect com- 
plement to the generalization process. 
"he perceptual disparity response a 
short-lived ; it seems difficult to see how 
1€ could produce the gradual — 

€crements in generalized response which 
lave been consistently found TUM qe 
Peated testing. In addition, while the 
GSR component of the disparity E UN 
1S monotonically related to stimulus 
differences, the amount of RT see 
ing is neither appreciably correlated w tn 
the GSR, nor does it appear to b 
Monotonically related to extent of stim- 
ulus change. d i 

Possibly, the perceptual dis 
action may be a trigger proc 


parity re- 
ess which 


PERCEPTUAL DISPARITY 


sets into motion the antigeneralization 
process or processes which cause the 
decline of generalized responding with 
repeated experience with the test stimuli. 
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EFFECT OF PRIOR KNOWLEDGE OF THE STIMULUS 
ON WORD-RECOGNITION PROCESSES ! 


RALPH NORMAN HABER 


Yale University 


Giving S knowledge of the stimul 


us word immediately prior to its ex- 


posure increased the probability of S being able to perceive all of the 


letters of the word (p « .001). 
perception of the letters was four 
words (p « .001), this difference 


While a difference in probability of 
nd for rare as compared to frequent 
completely disappeared when S had 


prior knowledge of the word. Both of these findings seemed consistent 
with a response interpretation of word-recognition processes. How- 
ever, giving S repeated exposures of the word increased the probability 
of seeing the letters, regardless of whether he had prior knowledge of the 


word, a result interpreted as quite 


inconsistent with response processes. 


Further, examination of Ss' reports showed that they were perceiving 
letters, not making guesses about the word, and that the percept of the 
letters gradually increased in clarity, quite independently of whether 
they knew the word. 'The similarity of these results and conclusions 
to Hebb's notions of the development of cell assemblies and phase 


sequences was pointed out. 


An interpretation of word-recogni- 
tion thresholds based on response 
processes (e.g., Eriksen, 1958; Pierce, 
1963) should predict that if S had 
complete and exact information about 
the stimulus word, just prior to its 
presentation, all variation introduced 
by the probability of occurrence of the 
word would disappear. This predic- 
tion is based on the assumption that 
if S has full prior knowledge of the 
stimulus, there can be no further 
variation introduced by differential 
probabilities of having response items 
available, or of testing the correct 
"hypotheses" about the stimulus. 
The present experiment attempted 
to test several implications of this 

diction. 

a Ss were shown both rare and 
frequent English words, with the 


i rch was partially supported by 

sai ipit dle United States Public Health 

z ES MH-03244. Deep appreciation is 

i ed to Maurice Hershenson, whose 

"t irme with the author on earlier studies 

e This one possible. Special thanks are 

mane i to Martha Breen for her assistance 
nis collection and analyses of the data. 


number of exposures varied for each 
word, but not their duration or 
intensity. Half of the words were ex- 
posed, prior to their first flash, for 5 
sec., and S had to spell the word to £. 
In this way, for half of the words, 
regardless of their frequency in print, 
the response probability of S being 
able to select the correct word from 
all of the possible words in his vocab- 
ulary was 1.00, since he knew the 
word exactly. 

It was expected that the words for 
which Ss had prior knowledge would 
have a higher probability of having 
all of their letters perceived. Further, 
it was also expected that there would 
be no difference in probability of 
perceiving all letters between the rare 
and frequent words when S had prior 
knowledge of the word. Without such 
prior knowledge, a difference would be 
found, favoring the words that had 
appeared more frequently in print. 
Both of these predictions follow from 
a response interpretation of recogni- 
tion thresholds. 

A more difficult prediction concerns 
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the effects of repeated exposures of 
the word. Haber and Hershenson 
(1965) and Hershenson and Haber 
(1965) both found that the probability 
of perceiving all.letters of a word in- 
creased markedly with repeated ex- 
posures, even though there had been 
no increases in either duration or in- 
tensity of the stimulus. Since their 
Ss were reporting whether they saw 
the letters of the words, rather than 
whether they knew or could guess the 
word, they argued that the repeated 
exposures were affecting the develop- 
ment of a percept or image, rather 
than response probability or availa- 
bility. Therefore, a “perceptual” in- 
terpretation would predict that even 
when Ss have prior knowledge of the 
word, there will still be a gradual 
increase in the probability of perceiv- 
ing all of the letters with repeated 
exposures of the word. A response 
interpretation would have to predict 
that repeated exposures should have 
no effect on recognition, since Ss 
know what the word is. 


METHOD 


Each S was shown 576 seven-letter, three- 
Syllable English words, divided into p 
rare and frequent. The rare words: I 
selected from the rarest third of those appear 
ing in the Thorndike and Lorge on lled 
mary word count (G) and from wordi (1939) 
from Webster's Unabridged Dictionary g 
that did not appear in the Thorndike-Lorge 
lists at all. The frequent words were m 
the median on the summary np ri in 

orndike-Lorge lists, and repe ient 
tally MO enrii population of frequ 

“nglish words with this structure. . r 

, These TEN were randomly ien 
nine lists of 64 words, each list to bi Y the 
One of the nine experimental sessions of im 
experiment, Each list had an equal Ru 
of randomly ordered rare and frequent W n A 

The apparatus and viewing aman 
Were the same as in the two previous En * 
(Haber & Hershenson, 1965; ih 
Haber, 1965), The stimuli were presented in 
One channel of a three-channel mirror tachisto- 
Scope (Scientific Prototype Manufacturing 

Orporation, Model D). A second chianues 


serving as background, was always lighted, 
and contained two faint lines for fixation 
boundaries. The reflectance measured at the 
eyepiece with a Macbeth illuminometer was 
10 footlamberts (ftL.) for the background and 
18 ftL. for the stimulus on the background. 

The S was always prepared for the flashes, 
since he initiated each trial by pressing a 
button to trigger the tachistoscope. 

Procedure.—Each word was assigned one 
of two duration values (high or low), one of 
eight exposure trial numbers (1, 2, 3, 4, 5, 10, 
15, or 25), representing the number of times 
it would be flashed, and one of two conditions 
of prior knowledge (none or complete). 
Duration was never changed during the pres- 
entation of a word, regardless of the number 
of exposures. The complete prior knowledge 
condition was provided by exposing the word 
for 5 sec. in the channel before the first trial 
and requiring S to spell out the word to E. 
The order of the frequency of the words was 
random, as were the number of exposure trials, 
and the duration of the exposures. However, 
every odd word was exposed first for prior 
knowledge. Within each session of 64 words, 
one word was shown for each of the 64 pos- 
sible combinations of experimental variables. 

Each S was given three practice sessions 
before the nine experimental sessions. "These 
practice sessions included only frequent 
words, comparable to the frequent ones used 
in the latter session, with no prior knowledge 
given for any of them. The duration at which 
these practice words were exposed was sys- 
tematically varied for each S, so that two 
durations could be determined for use in the 
latter sessions. The lower duration was one 
where S never identified all seven letters on 
the first flash, but occasionally did so when 
the duration was 1 or 2 msec. higher. Once 
this lower duration was determined, the higher 
duration was set as 5 msec. above it. 

For each flash, in both the practice and 
experimental sessions, S reported the letters 
and their respective positions he was certain 
he perceived. The S was required to report 
the letters, even when he could identify the 
word. Allanalyses were based on the percep- 
tion of letters, not of words. This maximizes 
reports based on what S saw, rather than on 
what he thought he saw. The S was scored as 
having perceived a word if he correctly re- 
ported seeing all of the seven letters on the 
last of the exposures given for that word. 
The S was given no feedback on his accuracy 
for the words without prior exposure. Fur- 
ther, S did not know at the time he was mak- 
ing his report for any trial whether there 
would be further trials for that same word. 
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‘This unpredictability of further exposures was 
stressed to S, since otherwise S could withhold 


his reports until he was more certain. The S 
alw knew when a new word was to be 
presented. The S was given extensive in- 


structions regarding the importance of report- 
ing letters rather than words, and he was 
reminded of them when necessary. 

Subjects—Sixteen Yale undergraduates 
each served 12 hr. as Ss. They were tested 
individually, and had never been in a percep- 
tion experiment before. 


RES 


Figure 1 presents the results for the 
number of exposures (trials), duration 
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of exposure, frequeney of words, and 
prior knowledge. Since there ap- 
peared to be no interactions with 
trials, each S's scores were summed 
across trials, and the means of those 
summed scores are indicated by the 
points on the right of the figure. For 
each duration, both prior knowledge 
means are significantly above both 
While 
this is more true for the low duration, 
t (15) = 10.61, p < .001, than for the 
high duration, ¢ (15) = 4.10, p < .001, 


no-prior knowledge means. 
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als, 
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a difference between durations that ts 
also significant beyond .001, the con- 
vergence of the curves for the high- 
duration words was more likely to be 
due to a ceiling, which was clearly 
reached much earlier for the prior 
knowledge words. In any event, it is 
clear that giving S prior knowledge 
of the word increases the probability 
of his perceiving all of its letters. 

For both durations, the difference 
between rare and frequent words was 
significant only when S had no prior 
knowledge—Low D: ! (15) = 6.73, 
b «.001; High D: t(15) = 3.47, 
b < .001. Both ¢ tests yield values 
less than 1.00 between rare and fre- 
quent when prior knowledge had been 
ziven. Thus, the second prediction 
Was supported, that prior knowledge 
obliterates the difference between 
rare and frequent words. 

The findings regarding 
trials clearly support the “perce 
rather than the response interpreta- 
tion. This was seen most clearly for 
the low duration, where there was no 
ceiling. Here, all four curves showed 
nearly identical effects of trials, re- 
gardless of the frequency of the words 
or the prior knowledge of S. The 
Same tendency is apparent for the 
high-duration curves, even though the 
prior knowledge words reach the ceil- 
ing first. Consequently, even when S 
knows exactly what the stimulus will 
be, his ability to see all of the letters 
grows gradually with repeated Pe 
Posures, just as it does when he does 
not know the stimulus. : A 

For purposes of comparison with 
the two previous studies, least-squares 
solutions were obtained for each of the 
eight curves. The identical function 
was obtained for each of the eight, 
which was also the same function 
found repeatedly in the previous ex- 
Deriments, This was 


the effects of 
ptual" 


P.,—-A-— qi/n 


where 
P, is the probability of perceiving all seven 
letters by the nth trial. 
A is the asymptotic probability obtained 
after 25 trials. 
qı is the probability of zot perceiving all 
seven letters on the first trial. 
n is the number of trials. 


Each least-squares fit indicated that 
the exponent of z did not differ from 
1.00 except by sampling error, but was 
significantly different (p < .001) from 
zero (a formal test of the significance 
of the effects of trials). 


DISCUSSION 


These results indicate that the effects 
of word frequency are probably mediated 
by response processes, since these effects 
are removed when a principal source of 
variance due to response processes is also 
removed. However, such response proc- 
esses probably do not account for the 
more basic perceptual behavior itself. 
While it seems intuitively reasonable that 
knowing what the stimulus will be should 
make it more easily perceived, this makes 
sense only if perceiving implies the ability 
to make some kind of probabilistic re- 
sponse of tentative recognition or identi- 
fication. However, the rather slow 
growth in the actual ability of S to see 
the letters of the word, even when he 
knows exactly what the word is, does not 
seem to be a change in probabilistic 
responses. 

This is most clearly illustrated by 
examining the nature of Ss' reports about 
their percepts. On the first flash of a 
word, S generally reported seeing nothing 
at all—no letters or pieces of letters. 
After several flashes, first pieces and then 
whole letters would be visible, and after a 
few more presentations, all of the letters 
would be perfectly plain and clear. The 
Ss, of course, could correctly guess the 
word well before all of its letters were 
perceived, but even after they had 
guessed it, there was a continued gradual 
development of the percept of the rest of 
the letters. These percepts became in- 
creasingly clear and distinct, so that 
reports after a number of presentations 
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were not guesses or "hypotheses," but 
were based on quite an explicit and un- 
ambiguous image of the word. Thus, 
while it is possible that response processes 
might be controlling the guessing of the 
word, especially from images without all 
of the letters present, it seems quite un- 
likely that response processes are also 
controlling the gradual development of 
the percept of the word. 

Another line of evidence supports this 
conclusion. The effect of trials, that is 
the slow growth in the probability of 
being able to perceive all letters as a 
function of repeated presentations, has 
now been found in three different experi- 
ments, under a variety of conditions. 
For example, this effect holds for frequent 
English words (this study, Haber & 
Hershenson, 1965, Hershenson & Haber, 
1965), for rare English words (this 
study), for totally unfamiliar nonsense 
words (Hershenson & Haber, 1965), and 
words for which S knew their exact 
content immediately prior to exposure 
(this study). Further, the mathematical 
equation specifying the function of ex- 
posure trials to perceiving the words has 
been the same for each experiment and 
for each condition within each experi- 
ment. Thus, it would seem that Ss’ 
knowledge or past experience with the 
words (the stuff of which response proc- 
esses would be made) is fairly irrelevant 
to the development of a percept. 

These three experiments, even taken 
together, do not suggest what mechanism 
would account for the growth of a per- 
cept. However, such a growth is remark- 
ably similar to Hebb's (1949) discussion 
of the growth of a cell assembly, and the 
organization of cell assemblies into phase 
sequences. If this analogy is correct, 
then the development of such assemblies 
could be considered independently of 
previously established assemblies. That 
is, the speed of associating the compo- 
nents of an assembly would be independ- 
ent of whether such an assembly had 
previously been established, even though 
the number of components with which 
one begins might be determined by the 
existence of previous assemblies, as well 
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as other variables. In this sense, the 
intensity or duration of the stimulus, and 
the amount of S's prior knowledge or 
experience with that stimulus, would 
determine how many components are 
available at the beginning of the develop- 
ment of an assembly, under some kind 
of assumption that the more elements 
available, the faster or more completely 
the assembly develops. These variables 
might also determine the maximum 
degree of development, such that if the 
duration or intensity were too low, or the 
previous assemblies too incomplete, then 
the current stimulus would not lead to a 
fully organized assembly, which implies 
that the perceiver would never fully be 
able to see all parts of the stimulus. 

The analogy to Hebb’s theorizing is 
useful, not because these data provide 
any kind of test of his work, but that 
there seems to be some kind of formal 
similarity between the concepts of growth 
in perceptual experience (ontogenesis) 
that Hebb speculates about, and the 
development of a percept (microgenesis) 
that was actually measured in this 
experiment. 
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It has been found that delay of 
reinforcement, among other factors, 
influences the strength of a condi- 
tioned motor response (CMR). The 
relationship is inverse, and the as- 
Sumed mechanism has been drive- 
stimulus reduction (Hull, 1943). 
Some advocates of this explanation 
Maintain that drive-stimulus reduc- 
tion is also responsible for conditioning 
Of fear responses. On the other hand, 
Mowrer (1950) posited drive-stimulus 
reduction for CMRs and contiguity 
for conditioned fear responses (CFRs). 

Whether drive-stimulus reduction 
affects learning of fear responses, might 
Perhaps be studied by manipulat- 
ing CS-onset UCS-termination delays 
(shock-termination delays, 
f in fear conditioning, sho 
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al designs were used to study the effects of 
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CS-onset UCS-termination delays of 
16, and 32 CS-UCS pairings were used in 
UCS duration was constant in Design I, 
c. CS-onset UCS-onset interval was con- 


in terms of running speed to the goal 


into which the CS, light previously paired 


Fear strength and CS- 
elated, a finding inter- 


bstruction. 
re inversely r 


process theory. The other factors were 


in the case of CMRs. The supposition 
might then be that the same mechan- 
ism is involved in the learning of both 
kinds of responses. Whereas some 
writers (Runquist & Spence, 1958; 
Strouthes & Hamilton, 1959) found 


CFR and STD to be inversely related, 
others (Mowrer & Aiken, 1954) 
did not. 


Conducting an experiment in which 
STD is varied necessarily involves 
two other variables, i.e., CS-onset 
UCS-onset interval, (ISI), and UCS 
duration. For, by virtue of the 
physical arrangements, when STD is 
manipulated, either of these two vari- 
ables can be made to vary system- 
atically while the other remains 
constant. 

In addition, the question of the 
effect of the number of CS-UCS paired 
presentations on CFR has not yet 
been satisfactorily answered. Some 
investigators (Gwinn, 1951; Kalish, 
1954; Libby, 1951; Strouthes & 
Hamilton, 1959) have found a positive 
relation between CFR and condition- 
ing trials, others (Mathers, 1957) have 
not, and still others (Goldstein, 1960; 
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and UCS arrangements in Designs I and It. 
(The three levels of CS-onset UCS-termina- 
iion delay in the two designs are the same— 
.58, 1.10, 2.20 sec. In Design I, the UCS 
duration is constant, .25 , While the CS- 
onset UCS-onset interval is invariant, .05 sec., 
and the UCS duration necessarily changes.) 


Libby, 1951) have showed a curvi- 
linear relation. In the present investi- 
gation, this factor was also studied. 
It was made to constitute the vertical 
dimension in the two designs; it is not 
shown in Fig. 1. 

Thus the problems under investiga- 
tion were: (a) To find the effect of 
CS-onset UCS-termination interval, 
and (b) of the number of CS-UCS 
paired presentations on CFR. (c) To 
make some inferences regarding the 
effects of UCS duration, and ISI 
on CFR. 


METHOD 
Subjects and Design 


The Ss in the two 3 X 3 factorial designs 
were 120 Wistar male albino 90-day old rats. 
They were randomly assigned to 15 subgroups. 
(Since 3 of the 9 subgroups in Design I were 
identical with 3 of the 9 subgroups in Design 
II, only 15 subgroups of eight rats each were 
used.) - The horizontal dimension in Fig. 1 
represents the three STD variations which are 
referred to as Groups AI, BI, CI for Design I 
and AII, BIH, and CII for Design II. 

In both designs the STD values were .55, 
1.10, and 2.20 sec. In Design I a .25-sec. 
UCS duration was held invariant, whereas 
the ISI varied. Thus, the ISIs were neces- 
sarily .30, .85, and 1.95 sec. In Design II the 
ISI was held constant at .30 sec., and the 
UCS duration necessarily became .25, .80, and 
1.90 sec. The CS-UCS paired presentations 
in both designs were 8, 16, and 32. 


Apparatus 
Runway.—A straight 48 X 4x 4 in. run- 
way with attached 7 X 7 X 4 in. start and 
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boxes was used. The boxes were 
separated from the runway proper by guillo- 
tine doors. A 7-w. bulb was mounted within 
the runway proper, 164 in. from the food cup 
or end wall of the goal box allowing 31 in. 
clearance from the floor of the runway. The 
light, off during training trials, could be made 
to blink at 16 cps during the test trials. Time 
was measured by a chronoscope controlled by 
the weight of the animal on two plates flush 
with the floor of the runway. They were 
located just outside the start box and in the 
goal box, 47 in. apart. The runway, the 
start, and goal box were painted flat white, 
flat gray, and flat black, respectively. 

Shock box.—The Ss were shocked in a 
17 X 5 X 7 in. wooden box. A 16-cps blink- 
ing light from a 7-w. bulb mounted so that its 
bottom touched a Plexiglas ceiling 3} in. 
above the grid served as CS during fear con- 
ditioning. A Stoelting automatic timer con- 
trolled the lengths of the CS, UCS, and the 
interval between them. The UCS, 290 pa. 
electric shock, was supplied from an auto- 
matic compensator. It consisted of inter- 
rupted direct current with on and off dura- 
tions of .014 and .041 sec., respectively. 


goal 


Procedure 


Runway lraining.—On the first day at 48 
hr. food deprivation, each rat was put in the 
start box for 20 sec. The S was then lifted 
out and placed in the goal box whence it was 
removed when the three 4 X 3.3 mm., 45-mg. 
Noyes tablets were consumed. This proce- 
dure was repeated two more times after which 
the running trials began. There was a total 
of 37 such trials: 2 on Day 1, 5 on Day 2, and 
10 on each of 3 subsequent days. "Twenty 
minutes after the final daily trial, S was 
allowed a 30-min. feeding. The Ss were run 
in virtual darkn 

Fear conditioning.—For CS-UCS relations 
see Fig. 1. Concerning CS-UCS pairings, 
since the groups received varying numbers, 
a procedure was adopted which allowed equal 
handling of Ss and constant time interva 
between the last fear conditioning trial and 
testing. Thus, Ss in the eight pairings groups: 
were fear conditioned on the last 2 fear con- 
ditioning days, whereas Ss assigned to 16 were 
conditioned on the last 4 days. On days be- 
fore fear was conditioned each S was simply 
placed in the shock apparatus for 5 min. 
without light or shock. 

Testing—The operations during testing 
were the same as during runway training 
except for the presence of the CS, which 
began blinking as the start-hox door was 
raised, and went off when the animal reached 
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the goal. A 2-min. criterion and a 1-min. 
intertrial interval were adopted. The Ss were 
tested on 3 consecutive days at 10, 10, and 
5 trials daily. Differences in running speeds 
were taken as an index of the differential CFR 
strength acquired during fear conditioning. 


RESULTS 
Design I.—All data were trans- 
1 
formed into running speeds, E 


X 100,000. Comparisons of the mean 
speeds based on the last four runway 
training trials for the nine subgroups 
resulted in nonsignificant Z5. A 
similar analysis of the first test-trial 
data showed that mean running 
speeds differed significantly, F Q, 63) 
= 3.30, p < .05, with respect to STD 
only, As shown in Table 1, the 
differences between Groups Al and 
BI, and AI and CI were significant at 
b =.05 and p «.05, respectively. 
There was no difference between BI 
and CI. This effect is shown in Fig. 2 
Where Ss from the two longer STD 
groups, BI and C1, reach the goal box 
faster than those from Al. There 
Were no differences, with regard to 
this variable beyond the first test 
trial. The groups did not differ ac 
Cording to CS-UCS pairings on any 
trial, 

Design II.—The statistical proc 
dure used to analyze the data I 

esign I] was similar to that enr 
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TABLE 


1 
CES WITH | 


RESPECT T 


Mean Speed 
Values by Groups 
‘Appearing in 
Column tand | P 5 
Presented in 
That Order 


Groups 


Design Resin 
1 


AT BI 93.54 133.29 | 39.75 
Al CI 93.54 140.95 | 47.41 
BI CI 133.29 140.95 7.66 


Design Design 
Il 1I 


All BII 93.54 86.25 7.29 — 
AIL CII 93.54 132.87 | 39.33 05 
BIL CH 86.25 132.87 | 46.62 «.05 
Design Design 
AT BII 93.54 86.25 7.29 — 
Al CII 93.54 132.87 | 39.33 .05 
BI AIL 133.29 — 93.54 | 39.75 -05 
BI BII 133.29 86.25 | 47.04 «.05 
BI CII 133.29 132.87 42 — 
CI All 140.95 — 93.54 | 47.41 | <.05 
CI BII 140.95 86.25 | 54.70 OL 
CI cil 140.95 132.87 8.08 — 


ployed in Design I. As in Design 1, 
(a) the groups were not reliably 
different at the termination of runway 
training. (b) Mean speeds for Test 
Trial 1 showed that STD was the 
only significant source of variation, 
F (2, 63) = 3.23, p < .05. As shown 
in Table 1, there was no difference 
between AII and BII, but there were 
differences between All and CIT and 
between BII and CII at p = .05 and 
p «05, respectively. According to 
Fig. 2, the longest STD group (CII) 
ran the distance faster than either 
All or BII. As in Design I, no other 
significant differences were found. 
The identical conditions under 
which the two designs were conducted 
made interdesign comparisons pos- 
sible. Thus, the mean speed differ- 
ences for STD groups on Trial 1 were 
tested. These differences and the p 
values appear in Table 1. Group 
comparisons across designs show that 
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(a) the groups with the shortest STD 
in either design were significantly 
different from those with the longest, 
and (b) the combination of inter- 
mediate STD (1.10 sec.) and inter- 
mediate shock duration (.80 sec.) 
Group BII, led to slowest running 
speeds. This group differs at P< .05 
from its corresponding Group BI in 
Design I, and at p = .01 from the 
longest STD group, CI, also in 
Design I. Furthermore, Groups AI 
and CII differ at = .05, and Groups 
CI and AII at p < .05. 


Discussion 


CS-onset UCS-termination delay, UCS 
duration, ISI.—While increments in STD 
were identical in both designs there were 
also concurrent increments of ISI in De- 
sign I and of UCS duration in Design II. 
Therefore, no unequivocal inferences 
could be drawn from either design sepa- 
rately. However, across design group 
comparisons made some conclusions more 
definite. Thus, overall comparisons 
show that where given STD groups 
differed reliably the direction of these 
differences could, generally, be considered 
the same in both designs. Thus, running 
speeds increased with longer STD, longer 
ISI in Design I and with longer STD, 
longer UCS duration in Design II. If 
ISI is assumed to play the important 
role, then the outcome in Design I was 
in the expected direction, However, in 
Design II even though the ISI Was con- 
stant, the groups still differed. Alter- 
natively if we assume UCS duration to 

play the important role in fear condition- 
ing, then in Design II CFR strength 
might be expected to increase às a func- 
tion of UCS duration. Yet this was not 
the case. Furthermore, there was no 
difference between CI and CII though 
Ss in the latter group were given almost 
eight times as long a shock as those of the 
former. The difference between BI and 
BII might be attributed either to a 
stronger CFR due to longer UCS in 
Design II or a weaker CFR due to longer 
ISI in Design I. The second possibility 
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seems unlikely since in CII for instance, 
the ISI was even shorter than that of BI 
yet the CFR was weaker. Thus, what- 
ever the effects of UCS duration and of 
ISI may have been they were weak at 
the third delay level and seem to have 
been offset by stronger effects of a “more 
important" factor, CS-onset UCS-termi- 
nation delay. 

In Design I, both BI and CI differed 
from AI. If ISI was responsible for the 
effects then the three groups in Design 
II should not differ, since the ISI was 
constant. Similarly, in Design II, both 
AII and BII differed from CII which had 
the longest UCS duration. In both 
instances these results may mean that 
STD is more important in fear condition- 
ing than ISI or shock duration. BII 
Seems to deviate from this general 
formulation. This finding cannot be 
explained readily, At best, perhaps the 
Particular UCS duration and ISI as well 
as the STD for that group make for 
“optimal” conditions, 

TheSTD gradient, as evidenced by the 
difference in running speed between the 
Shorter and the longer STD groups, 
Shows a steep decline. Itis in agreement 
with the theoretical postulations by Hull 
(1952, p. 131). 

Thus the following generalization 
Seems parsimonious: 
tween CS-on and UCS. ff 
the most important 
experiment, 
stronger CFR 
rapid CFR diminution, 
gradient. (c) When STD is “long” the 
effect of UCS duration 


However, (d) a 


d ly "long" shock, 
[, bring about Strongest CFR, 


STD. (e) ISI d 


effective as STD or UCS 


present study, STD interval 


from the 


Á acquire the UCS prop- 
erties. 


es. The findings Support those by 
Sullivan (1950) and Runquist and Spence 
(1958). Thus, it may be concluded that 
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drive-stimulus reduction affects learning 
of responses, be they motor or fear, and 
so perhaps no other principle is needed. 
Number of CS-UCS paired presenta- 
tions—As noted earlier, the relation 
between CS-UCS paired presentations 
on CFR has not yet been established. 
In this study no significant group differ- 
ences have been obtained. It is possible 
that with 40 reinforced runway training 
trials the strength of the approach re- 
sponse masked any differences due to 
CFR. It is also possible that, in part at 
least, the CS duration may have been 
responsible. For, whenever the CS went 
on during fear training, it lasted .25 sec. 
only, whereas on any test trial the CS, 
acting as obstruction, lasted a consider- 
ably longer time, thus enhancing CFR 
extinction which in turn "erased" any 
group differences due to pairings. 
Though there were no significant group 
differences there was a strong suggestion 
that as CS-UCS pairings increased be- 
yond a certain number the CFR de- 
creased. Current work supports the 
desensitization explanation. 


REFERENCES 


GorpsrEIN, M. L. Acquired dri 
a joint function of shock it 
number of acquisition trials. 
chol., 1960, 60, 349-358. 


ve strength as 
ntensity an 
J. exp. Psy- 


291 


Gwinn, G. T. Resistance to extinction of 
learned fear drives. J. exp. Psychol., 1951, 
42, 6-12. 

Hutt, C. L. Principles of behavior. 
York: Appleton-Century-Crofts, 1943. 

Hutt, C. L. A behavior system. New Haven: 
Yale Univer. Press, 1952. 

Karrsg, H. I. Strength of fear as a function 
of the number of acquisition and extinction 
trials. J. exp. Psychol., 1954, 47, 1—9. 

Lipsy, A. Two variables in the acquisition 
of depressant properties by a stimulus. 
J. exp. Psychol., 1951, 42, 100-107. 

Matuers, B. L. The effect of certain pa- 
rameters and the acquisition of fear. J. 
comp. physiol. Psychol., 1957, 50, 329-333. 

Mowrer, O. H. Learning theory and per- 
sonality dynamics. New York: Ronald 
Press, 1950. 

Mownzn, O. H., & Aiken, E. G. Contiguity 
vs. drive-reduction in conditioned fear: 
Variations in conditioned and unconditioned 
stimuli. Amer. J. Psychol., 1954, 67, 26-38. 

RuNQuisr, W. N., & Spence, K. W. Tem- 
poral effects of conditioned fear on the eye- 
lid reflex. J. exp. Psychol, 1958, 55, 
613-616. 

Srroutues, A., & HawiLTON, H. C. Fear 
conditioning as a function of the number 
and timing of reinforcements. J. Psychol., 
1959, 48, 131-139. 

SuLLIVAN, J. J. Some factors affecting the 
conditioning of the galvanic skin response. 
Unpublished doctoral dissertation, State 
University of Iowa, 1950. 


(Received January 22, 1964) 


New 


Experimental Psychology 
is va. 69 No. 3, 292-297 


EFFECTS OF DRIVE, REINFORCEMENT SCH 
AND CHANGE OF SCHEDULE ON PERFOR} 


EDULE, 


TANCE ! 


PIETRO BADIA 


Bowling Green Stale U; niversity 


indicated that high- and medium-drivı 


shifted to partial incre: 
but under low drive n 
occurred for high- 
continuous reinfor 


ased speed of rı 


and medium-drive, 


recent theoretical statements. 


The present study was designed to 
investigate the effects of drive level 
and partial reinforcement on behavior 
during the acquisition of a running 
response. A second purpose was to 
determine the effects of shifting from 
partial to continuous reinforcement in 
some animals and from continuous to 
partial reinforcement in others, 

Several studies (Goodrich, 1959; 
Wagner, 1961) have provided support 
for the theoretical prediction of Amsel 
(1958) and Spence (1960) that par- 
tially reinforced Ss will perform at 
higher levels than continuously rein- 
forced Ss. The basis for this pre- 
dicted superiority is that nonreinforce- 
ment and the subsequent development 

of conditioned frustration (rj) results 
in an increased motivational level. Jt 
is not known, however, how drive 
affects the development of rj. Ap- 
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parently, only two studies have been 
reported dealing with partial rein- 
forcement and drive level (Lewis & 
Cotton, 1957; Linton & Miller, 1951). 
Drive was manipulated only during 
extinction in the Linton and Miller 
study ; in the Lewis and Cotton study 
training was terminated before asymp- 
totic measures were obtained and in- 
stead of response segments, total 
running time was recorded, 
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plastic floor was scored from end to end with 
.25-in. squares and sandblasted. 

Filtered (Wratten 25A) light beams were 
directed from above the maze to photo- 
electric cells below the maze. Both the light 
sources and photocells were on tracks. The 
apparatus was located in a sound-deadened 
room illuminated by one 73-w bulb with a 
random noise generator providing white noise 
for masking purposes. The maze was illumi- 
nated by four 7}-w. bulbs along each side 
with the illumination level maintained at 40% 
by a variac. Starting, running, and goal 
measures were provided by three timing cir- 
cuits activated by the interruption of the 
filtered photocell light sources. The first 
running time was measured from the inter- 
ruption of the first beam 4 in. from the start 
door to the interruption of the second beam 
16 in. from the start door and 12 in. from the 
lirstbeam. ‘The second running-time measure 
Was taken from the interruption of the second 
beam to the interruption of the third beam 28 
in. from the start door and 12 in. from the sec- 
ond beam. Goal-box time was measured from 
the interruption of the third beam to inter- 
ruption of the fourth beam 40 in. from the 
start box and 12 in. from the third beam. 
This last beam was 7 in. from the end of the 
runway and 6 in. from the food cup- ! 

Design.—Three levels of drive and two 
Percentages of reward were combined in a 
2X3 factorial design. The drive levels 
employed were 23 hr., 115 hr» and 225 hr., of 
deprivation, In addition the high-drive Ss 
(223 hr.) were kept at 85% of their, ad-lib. 
body weight. The percentages of reinforce- 
ment employed at each level of drive were 
100 and 50%. Twelve Ss were randomly 
assigned to each of the six cells of the 2 X 3 
design, However, due to sickness (a S) or 
failure to run (4 Ss) 5 Ss were discarded, 4 ol 
which were replaced by reserve animals on 
"and for this purpose, resulting in 1 S 
each cell except for the high-drive (224 Y^ 

0% reinforcement cell which had 11 a " 

Prelraining.—Vifteen days before b i 
acquisition trial all Ss were placed on à 2 " z i 
lood deprivation schedule and continuec or 
a throughout the course of the experiment. 


x . ainin: 
“ive days prior to experimental trati 2 


cack pro to 45-mg. pellets 
ch S was introduced P t these pellets 


(Noyes) by receiving 
immediately before his regularly achediled 
meal. On Day 2 this ration Wes pem ts 
20 pellets, on Day 3 to 10 pellets, and Er ae 
Were then given on the remaining ^ ee 
Wo days prior to the first aomi e 
each S was allowed 5 min. tO explore t 
runway, 
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Experimental preshift training.—The Ss 
were tested in squads of six, one S from each 
of the experimental groups. The Ss in the 
100% groups were reinforced on all trials 
with four pellets (Noyes). The Ss in the 
partially reinforced groups (50%) also 
received four pellets each in the follow- 
ing sequence of reinforced and nonrein- 
forced trials: +—+—, +—+, +--+, 
—++ —. On Days 1 and 2 only one trial 
a day was given to each S, on Day 3 two 
trials were given, and beginning on Day 4 and 
thereafter all Ss were given four trials a day. 

After S was placed into the starting box, 
during a training trial, the vertical door was 
raised when S faced it and 2 sec. later, the 
horizontally sliding clear plastic door was 
opened. When S left the start box, the 
vertical door was lowered to prevent retracing. 
When S entered the goal box, the vertical 
door was lowered and after eating the food, 
removed. On nonreinforced trials 5 was re- 
moved after 10 sec. After all Ss had received 
1 trial the sequence was repeated until each 
had 4 trials. About 45 min. after the experi- 
mental session was completed, Ss were re- 
turned to their home cage. A total of 88 
acquisition trials was given each S. 

Experimental postshift training.—After the 
88 acquisition trials the percentage of rein- 
forcement was reversed for all Ss. The 100% 
reinforced groups were reduced to a 50% 
reward schedule and the 50% reinforced 
groups shifted to a 100% reward schedule. 
‘An additional 32 trials were then run under 
the conditions that prevailed prior to the 
shift. 


RESULTS 


The running and goal speeds were 
transformed to reciprocals and the 
performance measures were separately 
analyzed for the three phases of 
training: initial preshift trials (1-20) ; 
terminal preshift trials (69-88) ; post- 
shift trials (101-120). 

Preshift trials —From Tables 1, 2, 
and 3 it appears that in all three pre- 
shift response measures during the 
early trials (1-20), continuous rein- 
forcement produced faster running 
speeds than partial reinforcement 
while high drive produced faster 
running speeds than medium drive, 
which, in turn, produced faster run- 
ning speeds than low drive. For all 
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TABLE 1 


MEAN RUNNING SPEED IN THE First SECTION 
OF THE ALLEY AS A FUNCTION OF DRIVE, 
PERCENTAGE OF REINFORCEMENT, 


AND TRIALS 
i Trial: Trials | Trials 
Dad Rosen 120 | 69-88 101-1202 
: 100 1.767 | 3.335 | 3.358 
High 


50 1.402 | 3.493 | 3.486 


of drive and two percentages of 
reinforcement were separately anal- 
yzed for the three response measures 
over the initial trials (1-20) and 
terminal trials (69-88). The separate 
analyses for the initial trials showed 
that: (a) percentage of reinforcement 
was significant for the first, second, 
and goal-speed measures, F (1, 65) 


; 100 | 1.462 | 2.870 | 3.125 
Medium ede «il 4930 |3169 3.179 


p 100 1.275 | 2.307 | 2.390 
zw 50 1.033 | 2.247 | 2.365 


a Postshift trials. 


three response measures the higher 
drive groups appeared to have main- 
tained their superiority over the lower 
drive groups in late acquisition train- 
ing also. However, at the terminal 
phase of training for all drive levels, 
the marked superiority of continu- 
ously reinforced groups over the par- 
tially reinforced groups was consider- 
ably reduced for the goal-response 
measure, eliminated for the second 
Speed measure, and apparently re- 
versed for the high- and medium-drive 
groups in the first section of the run- 
way (first speed measure). 

Mean running speeds at three levels 


TABLE 2 


MEAN RUNNING SPEED IN THE SECOND 
SECTION OF THE ALLEY AS A Function 
OF DRIVE, PERCENTAGE oF 


REINFORCEMENT, 

AND TRIALS 
rive Percent | Trials Trials Trials 
real Reward | 1-20 69-88 | 101-1208 


; 100 | 1.900 | 3271 | 3.480 
High 50 | 1.522 | 3:372. | 3344 


. 100 | 1.605 | 2.859 | 3.052 
Medium | "so | 1492 | 3.101 | 3:085 


100 | 1.374 | 2.315 
Low 50 1.161 | 2.254 


| 2477 
2.366 


a Postshift trials. 


— 32.60, 28.06, and 29.51, respec- 
tively, p < .005; (b) drive level was 
significant for all three response meas- 
ures, F (1, 65) = 26.36, 26.25, 18.66, 
P < .005; (c) the interaction between 


TABLE 3 


MEAN RUNNING SPEED IN THE GOAL SECTION 
OF THE ALLEY AS A Function or Drive, 
PERCENTAGE oF REINFORCEMENT, 


AND TRIALS 
Drive Percent | Trials Trials Trials 
Level | Reward | 1-20 69-88 | 101-120" 
High 100 | 1.642 | 3.151 | 3.261 
Bn 30 | 1.286 | 3:001 | 3074 


- 100 | 1424 | 2715 | 2721 
Medie sene |. dpa 2.813 | 2:879 


í 100 | 1177 | 2186 | 2214 
Low 50 | 1.046 | 2039 | 2238 


? Postshift trials, 7 


orcement and drive 


ficant on any of the 
three response measures, 


Ww trials —The analyses for 
the terminal 


34.08, 

42.84, respectively, p < .005; (c) in 

was there a gj 

reinforcement 
teraction, 

As seen from Tables 1, 2, and 3, the 

net effect of shifting the percentage 
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of reinforcement was a convergence 
of the postshift running speeds for the 
continuously and partially reinforced 
Ss. Since terminal preshift asymp- 
totic performance levels between con- 
tinuously and partially reinforced Ss 
did not differ significantly, it is obvi- 
ous that the converging postshift run- 
ning speeds would not differ. How- 
ever, differential within-group eflects 
resulting from shifting the percentage 
of reinforcement did occur. Under 
conditions of high- and medium-drive 
levels, continuously reinforced Ss 
shifted to a partial schedule, increased 
their speed of responding on the first 
and second running speed measures, 
but not on the goal-speed measure. 
However, under conditions of low 
drive, little change resulted from the 
shift on any of the response measures. 
It also appears from Tables 1, 2, and 3 
that under conditions of high an 
medium drive there is little effect of 
shifting partially reinforced Ss to a 
continuous schedule. But under con- 
ditions of low drive, shifting from a 
Partial to continuous schedule re- 
Sulted in an increase in running speed 
9n all three response measures 
. Since separate statistical analyses 
involving percentage of reinforcement, 
drive level, and pre- and postshift 
trial blocks (last 20 pre- and postshift 
trials), resulted in a significant triple 
interaction for the first and seeond 
Speed measure, separate analysis o 
the preceding observations was com- 
Puted for all three response measure. 
The results of these within analyses 
of partially reinforced 55 shifted to 
Continuous reinforcement showed no 
significant. differences between Pre- 
and postshift speed measures under 
Conditions of high and medium drive. 
Under conditions of low drive par- 
tially reinforced Ss, shifted to 2 con- 
tinuous schedule, did increase their 
Speed of responding on all three re- 


sponse measures, F (1, 11) = 11.05, 
p < .025; F (1,11) = 19.38, p < .005; 
and F (1,11) = 31.72, p < .005 for 
the first, second, and goal-speed 
measures, respectively. 

The analyses of the high-drive, con- 
tinuously reinforced Ss shifted to a 
partial schedule showed that: (a) pre- 
and postshift running speed differ- 
ences for the first response measure 
were significant, F (1,10) = 12.14, 
p « .005, reached significance in the 
second response measure, F (1, 10) 
— 6.32, p — .025, but did not attain 
significance for the goal-response 
measure, F (1, 10) = 3.35, p < .10; 
(b) under conditions of medium drive 
also the postshift running speeds were 
superior to preshift speeds for the first 
and second response measures yielding 
F (1, 11) = 1415, p < .005 and 
F (1, 11) = 12.83, p < .005, respec- 
tively, but did notfapproach signifi- 
cance for the goal-response measure 
(F <1); (c) with none of the three re- 
sponse measures did low-drive Ss show 
a significant change in speed of re- 
sponding when shifted from continu- 
ous to partial reinforcement. 


Discussion 


Several of the findings in this study 
were expected. The second-order inter- 
action between percentage of reinforce- 
ment and trials predicted by the theories 
of Amsel (1958) and Spence (1960), re- 
ported by other investigators (Goodrich, 
1959; Wagner, 1961), and obtained in 
this study was significant in both the 
pre- and postshift phases of training. 
This interaction indicates that percent- 
age of reinforcement affected the rate of 
acquisition training. But in none of the 
preshift performance measures was there 
a significant triple interaction. Failure 
to obtain this interaction suggests that 
the differential effects of percentage of 
reinforcement on performance are in- 
dependent of drive level. It should be 
noted in regard to this latter finding that 
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there is nothing in the theories cited 
(Amsel, 1958; Spence, 1960) to suggest 
findings contrary to this. 

The acquisition running speeds for the 
first and second measures follow a pat- 
tern similar to that reported by others 
(Goodrich, 1959; Haggard, 1959; Wag- 
ner, 1961). Although Partially rein- 
forced groups performed at an inferior 
level early in training, it is clear that 
following extended training they attain a 
level of performance if not greater, at 
least equal to, continuously reinforced 
Ss. Despite the fact that the apparent 
reversal of initial order for the high- and 
medium-drive groups was not significant, 
the direction is consistent with that of 
other studies where statistical support 
was obtained (Goodrich, 1959; Wagner, 
1961). These results provide some sup- 
port for the theoretical formulation of 
Amsel (1958) and Spence (1960). 

Contrary to the preshift findings, the 
analysis using pre- and postshift trial 

blocks indicated a lack of independence 
between percentage of reinforcement and 
drive level. While the shift from con- 
tinuous to partial reinforcement resulted 
in an increase in running speed for high- 
and medium-drive groups on the first two 
response measures, no increase occurred 
for the low-drive group. In contrast to 
these findings, the shift from partial to 
continuous reinforcement led toa signifi- 
cant increase in running speed for the 
low-drive group, but not for high- and 
medium-drive groups. It should be 
noted that the earlier preshift relation- 
ship between percentage of reinforce- 
ment and drive level, though lacking 
statistical support, was in the same 
direction as the latter findings. 

The findings of the postshift analysis 
suggest two possible extensions of the 
theoretical statements of Amsel (1958) 
and Spence (1960) regarding the condi- 
tioned form (rs) of the hypothetical 
emotional response termed frustration. 
In reference to the first extension, the 
findings of the present investigation 
imply that under conditions of low drive 
and small reward 7; may develop only 
minimally or not at all. Only under 


intermediate or high drive may ry 


develop suffciently to affect behavior. 
At present the theoretical statements 
offered by Amsel (1958) and Spence 
(1960) cannot adequately explain these 
data in that they assume ry, resulting 
from nonreinforcement, simply to be a 
function of the vigor or strength of rg. 
The development of fo in turn, is 
considered to be independent of drive and 
dependent only upon the characteristics 
of the reinforcement employed, such as 
magnitude and delay of reward. Since 
partially. reinforced low-drive Ss when 
shifted to continuous reinforcement 
Showed a significant increase in running 
speed, while those shifted from continu- 
Ous to partial reinforcement did not, 
extension of these theoretical statements 
to include the drive dimension in regard 
to the development of 7; Or r, may be 
necessary, 

The second extension suggested by the 
findings deals with the extinction of the 
rs response. Since the acquisition of ry 
is based upon the laws of classical condi- 
tioning, Presumably these laws should 
Y to the extinction of rj. Since 
ing the unconditioned Stimulus 
leads to the extinction of the conditioned 
response, eliminating the unconditioned 
stimulus (in this case, nonreinforcement) 
should result in the extinction of 7; and 
loss of the motivational component con- 
tributed by it, A decrement in response 
strength following the extinction of ry 
should reflect this reduced motivational 
level. 

The results of this investi 
in agreement with the above theoretical 
outline. While Ss shifted from con- 
tinuous to partial reinforcement (32 
redicted increment 
d, Ss shifted from partial 
reinforcement (32 trials) 
' à decrement in running 
Speed. It may be that while 32 trials are 
ident a acquire the r, response, con- 

} ore trials may be necessary 


to extinguish this emotionally based 
response, 


gation are not 
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University of North Carolina 


An experiment was conducted rej 
study by Slack in which size estimates of ofi- 
object had been found to regress to familiar 


rich environmental information, 


Two rather different approaches to 
the determinants of apparent size and 
distance may be distinguished. On 
the one hand there is the approach to 
be found in Gibson (1951) which 
Stresses the rich psychophysical in- 
formation usually present and mini- 
mizes the role of known, familiar 
object size; on the other hand there 
is the transactionalist approach (see 
e.g., Ittelson, 1962) which emphasizes 
the role of past experience in building 
up in O an object representation, so 
that the object's dimensions are 
known once it is recognized. Each 
approach has led to experimentation 
which shows that the sort of variables 
it considers may provide Sufficient 
conditions for size and distance per- 
ception. Thus, for example, Gibson 
(1951) has done work that indicates 
that the size of anonymous objects, 
such as stakes, may be accurately 
judged, even at great distances, if 
they are placed in the richly infor- 

1 The authors are greatly indebted to R. D. 
Bock and S. Zyzanski for aid in the analysis 
of the data on the UNC computer. 

* Visitor at the Center for Cognitive Stud- 
ies, Harvard University. . 
? Now at the University of Minnesota, 


plicating and extending an earlier 


size versions of a familiar 
size, under conditions of 


Size and distance estimates were 
obtained for normal, Oversized, and undersized 


stakes and abstract Constructions) located ona 
distances from S. There was no t 
judgments to be systematically bi: 
Slack’s findings were not replicated 
Position which argues that familiar s 
determinants of apparent size, under 
servation in a richly informative enviro: 


chairs (and control 
dirt road at various 


endency for either size or distance 
ased as a function of familiar size; 


These results do not support any 
ize is one of the major operative 
ordinary circumstances of ob- 
nment, 


mative environment of a plowed field. 
The transactionalists have shown that 
under conditions where all environ- 
mental information is eliminated and 
only object information remains (e.g., 
one-eyed viewing in a completely dark 
surround) O will rely upon his past 
experience with an Object in deter- 
mining its apparent size and distance 
But what happens 
f information are 
ere is both rich 


Statistically signifi- 
her small in absolute 


Characteristically, 
are viewed under Co; 
environmental 


familiar objects 
nditions of rich 
information so the 
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question of the determinants of per- 
ceived size under such circumstances 
is an important one. There is little 
direct evidence on the issue and there- 
fore this study was designed as a 
replication and extension of Slack's 
experiment. In addition to a number 
of procedural changes, such as com- 
plete randomization in the order of 
presentation of objects, this study 
extends Slack's experiment in two 
main respects: (a) Additional control 
objects were used. Slack used stakes 
as his control objects, but a stake 
differs from a chair not only in its lack 
of characteristic size but also in being 
visually less complex. A stake con- 
Sists of one vertical only; whereas a 
chair has a number of vertical and 
horizontal sides. In this study, in 
addition to stakes, abstract construc- 
tions with a number of verticals and 
horizontals were used as control ob- 
jects with the aim of more closely 
approximating chairs in regard to 
Visual complexity. (b) Distance esti- 
mates were obtained concurrent with 
the size judgments. This was done in 
Order to determine whether there was 
any systematic bias in the apparent 
distance of the off-size familiar ob- 
jects, and to permit an examination 
of the relation between apparent size 
and apparent distance in terms of the 
Size-distance invariance hypothesis 
(Kilpatrick & Ittelson, 1953). For 
*xample, if an undersized chair 1s seen 
as larger than a control object (of the 
same size at the same distance) will 
it also be seen as farther away? 


METHOD 


Sudjects—The Ss were met at 2 university 
uilding, were blindfolded, and driven to the 
Site of the experiment. There were 19 Ss, 
Volunteer male college students from the 
tMtroductory course in psychology- The data 
"om 3 additional Ss were discarded since 
heir distance estimates were quite erratic, 

US an object at 30 yd. might be estimated 
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Medium size abstract construction 
at site of experiment. 


Fic. 1. 


as closer than one at 20 yd. or farther away 
than one at 40 yd. 

iApparatus.—Three wooden chairs, three 
wooden stakes, and three abstract construc- 
tions constituted the experimental materials. 
The chairs were 563, 34, and 25 in. in height, 
and were proportional in length and width. 
As a result of some accidents in carpentry our 
objects were slightly smaller than those em- 
ployed by Slack which were 58, 343, and 25} 
in. in height, respectively. As in the Slack 
study, the chairs were armless and had two 
horizontal slats in the back. The stakes were 
2 X 4 in. pieces of wood equal in height to the 
chairs. The abstract constructions, also 
equal in height to the chairs, consisted 
of stakes with horizontal and vertical arms 
so arranged as to be similar to the chairs in 
visual complexity ie. in the number and 
sizes of the horizontals and  verticals. 
A photograph of one of the abstract 
constructions in situ is shown in Fig. 1. All 
objects were painted a dark brown and were 
stored some distance behind S between trials, 
S being firmly instructed never to look behind 
him. An adjustable cloth tape attached to 
the ground at one end (and painted brown on 
S's side) was used to obtain the size estimates. 

The experimental site was an unused, 
closed off dirt road several hundred yards in 
length, and approximately 20 ft. in width; 
there was a wide open field and a few bushes 
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on one side of the road and trees on a bank 
on the other side. ‘The trees were not dis- 
tinctly separate or of any specific and there- 
fore informative size; rather they blended 
into a quite homogeneous mass. The view 
was such that no artifacts such as telephone 
poles or houses were in sight during direct 
inspection of test objects. T he field of view 
afforded rich environmental information via 
many gradients of texture, there was the dirt 
of the road with pebbles and small rocks 
scattered irregularly, there was the grass, and 
there were the bushes and trees on the side 
of the road. 

Procedure.—The Ss viewed all objects 
facing in the same direction. Objects were 
presented at 20, 30, and 40 yd. from S. Both 
size and distance judgments were made on 
each object at a given distance before it was 
removed and another object presented. Half 
of the Ss made the size judgment for each 
object before the distance judgment, this 
order was reversed for the other Ss. The 
order in which the various objects were 
presented at the various distances was ran- 
domly determined for each pair of Ss, since 
two Ss were run in each session, great care 
being taken that no communication of any 
sort take place between Ss. While one S was 
giving his responses the other sat on the side, 
blindfolded. 

Size estimates were recorded as follows: E 
stood 3 ft. in front and 5 ft. to the right of S$ 
and moved the comparison tape up or down, 
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as directed by S by means of an upward or 
downward motion of the hand, until the tape 
appeared to S to have been extended to the 
same height as the object. This height was 
then recorded by E to the nearest half inch. 
In the case of distance estimates .S was asked 
to estimate how far away, to the nearest 
yard, the object appeared to be, and to record 
each estimate on a fresh page of a booklet 
with which he had been provided. 

The Ss were told to judge the height and 
distance of the objects from them. They were 
asked to make their judgments “in the same 
manner as you would judge the height and 
distance of things every day.” Essentially 
the instructions for height estimation were 
similar to those used by Slack, and instruc- 
tions for distance estimation were similar to 
those for height estimation with appropriate 
modifications. : 

The Ss were required to replace their 
blindfolds after making their judgments for 
each object, and remained blindfolded while 


one object was being replaced by another 
object. 


RESULTS AND Discussion 


The main findings are shown in 
Table 1 which gives the mean ap- 
parent height or size, and mean 
apparent distance from S, for each 


object. Analyses of variance were 
TABLE 1 
MEAN APPARENT SIZE AND MEAN APPARENT DISTANCE 
Distance in Yd. 
Object 20 30 40 
Siz Distance iz i iE x 
inf. | "yen |o fe | stance ints, | GM 
wc WENT. 15.37 J 
Chair . . 64.32 
Abstract 62.89 15.47 62.84 2410 n $5 
Stake 63.89 15.26 62.89 24.05 63.47 36.37 
Medium (34 in.) d 16.37 
Chair s . 42.84 2 4 5 
Abstract 59 | 1374 | azes | xa | HO | 3605 
Stake 43.32 17.05 42.63 24.91 43.53 36.32 
Small (25 in.) "m mm [ f K B 
Chair 3.3 è 34.21 26.58 6 
Abstract 33.84 15.63 35.21 25.63 Ed FERT 
Stake 3247 16.42 33.58 25.68 36.00 36.74 
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performed on the size and distance 
judgments and yielded the following 
results. 

Size judgments.—There was no sig- 
nificant main effect due to Distance, 
or due to Objects; there was a 
significant main effect due to Ss, 
F (18, 396) = 47.85, p < .01; and due 
to Size, F (2, 36) = 754.77, p < .01, 
with larger objects being judged 
larger. In addition, the following in- 
teractions were also significant —5s 
X Size, F (36, 396) = 17.20, P < .01; 
Ss X Distance, F (36, 396) = 2.63, 
b « .01; Size X Distance, F (4, 396) 
= 8.96, p < .01; and Size X Objects, 
F (4, 396) = 2.95, p < .05. With re- 
gard to the two significant interactions 
not involving Ss we note, for the Size 
X Distance interaction, that there 1s 
à tendency for small objects to be 
judged somewhat larger the farther 
away they are; there is no such tend- 
ency for the medium or large objects. 
This result is somewhat different from 
that obtained by Slack who found an 
overall significant tendency for objects 
to be judged somewhat larger the 
farther away they were. But the 
Critical interaction is that between 
Size and Objects, which indeed | is 
Significant (p < .05). An inspection 
of Table 1 reveals, however, that this 
interaction is significant 0! because 
Oversized chairs are underestimated 
and undersized ones overestimated 
relative to their control objects, 4 Was 
found by Slack, but rather because 
Oversized chairs are somewhat over- 
estimated relative to both of their 
Control objects, at each of the three 

istances, No consistent tendency . 
all is apparent when judgments © 
medium or undersized chairs are com- 
Pared with those of their control 
objects, 

_Distance estimate 
nificant main effects due 
F (2, 36) = 174.74, P <` 


s.—There were sig- 
to Distance, 
01; due to 


Size, F (2, 432) = 15.51, p < .01; due 
to Objects, F (2, 432) = 3.51, p < .01; 
and due to Ss, F (18, 432) = 143.39, 
p < .01. Only one of the interactions, 
that between Ss and Distance, was sig- 
nificant, F (36, 432) = 11.20, p < .01. 
The main effect of Distance indicates 
merely that estimates of apparent 
distance varied with actual distance. 
The main effect due to Size reflects 
principally the fact that large objects 
were seen as somewhat closer than 
medium or small objects at the same 
distance (a sort of distance constancy 
compromise effect). The Object effect 
is due to the fact that, on the average, 
the abstract construction is seen as 
somewhat closer than the chair, which 
in turn is seen as somewhat closer 
than the stake. 

An examination of Table 1 indicates 
that there is no consistent relation 
between mean apparent distance and 
mean apparent size for the trios of 
objects representing each of the nine 
Size X Distance combinations. The 
correlation between judged size and 
estimated distance was calculated 
separately for each of the 27 Size 
x Distance X Object combinations. 
The correlation coefficients obtained 
were moderate in value (median 
y = AT), and rather homogeneous. 
It should be noted that from the point 
of view of this study interest is not in 
the size-distance invariance hypoth- 
esis per se, indeed given the method 
used in this study where size and 
distance estimates are correlated for 
objects of given size at a given dis- 
tance the evidence is equivocal (see 
Gogel, Wist, & Harker, 1963). Rather 
we are interested in whether or not 
the size-distance relation is system- 
atically affected when an off-sized 
familiar object is involved. An in- 
spection of the correlation coefficients 
indicated that, in each case, the corre- 
lation between judged size and es- 
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timated distance for the undersized, 
normal, or oversized chair was very 
similar to that for the appropriate 
control objects presented at the same 
distance. 


It is clear that our results do not 
replicate those of Slack, that there is no 
tendency for size judgments of grossly 
off-size versions of familiar objects to 
regress to familiar or known size. There 
are indeed some differences in procedure 
and setting between this experiment and 
that of Slack. In particular, Slack per- 
formed his experiment in a large flat 
field whereas this experiment was con- 
ducted on an unused dirt road with a 
field on one side, and trees on a bank on 
the other side, the view being such that 
no artifacts such as telephone poles or 
houses were in sight during inspection of 
the test objects. It is hard to see how 
such differences in setting could account 
for the differences in results, and if the 
differences in results are attributable to 
differences in setting and procedure, then 
the generality and importance of Slack's 
findings, of which much is made by Ittel- 
son (1962) is certainly called into 
question. 

It should be pointed out that while 
Slack obtained statistically significant 
results, the effects were actually quite 
small in absolute terms, the difference in 
apparent height between chairs and 
stakes as a percentage of the total ap- 
parent height of the chairs varying from 
4% to 10%. Thus for the oversized 
chair, where the effect was the greatest, 
the average difference between it and its 
control stake was less than 4 in. whereas 
the average judged difference between 
the oversized chair and the medium 
chair was over 17 in. Thus the effect of 
information from the object's environ- 
ment is greatly preponderant over the 
effect of familiar size, in the Slack study. 

Slack's experiment and this extension 
of it represent attempts to set up a 

confrontation between two different 
classes of possible determinants of ap- 
parent size which usually yield mutually 
consistent information. Insofar as very 
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gross magnification or diminution was 
employed in these studies it may have 
had the effect of forcing S to recognize 
that very queer special sorts of objects 
were being presented for judgment, lead- 
ing him to make his size and distance 
estimates in terms of all the information 
that could be extracted from the environ- 
ment, with the result that there is no 
contamination of these estimates by 
familiar size in this experiment (and at 
the most a small, though statistically 
Significant, regression to known size in 
Slack's study). It might be of consider- 
able interest to repeat this sort of experi- 
ment with more moderate though still 
supraliminal over- and undersizing of 
objects, coming from classes homogene- 
ous as to size. If under such circum- 
stances one were still to obtain findings 
similar to those of this experiment, i.e., 
no regression to known familiar size, then 
the generality of the results obtained by 
the transactionalists under conditions of 
degraded information would be very 
much in doubt. In fact, even for condi- 
tions of degraded information the results 
of the transactionalists have been chal- 
lenged by the findings of a number of 
studies, see e.g., Hochberg and Hochberg 
(1952) and Epstein (1961); for a review 
of the relevant literature see Epstein, 
Park, and Casey (1961). 

Certainly our knowledge of objects 
built up through a history of commerce 
with them may be sufficient to specify 
their sizes, as indicated not only by the 
work of the transactionalists but also in 
studies such as that by Bolles and Bailey 
(1956) in which it was found that Ss' 
estimates of the sizes of familiar objects 
were no less accurate with eyes closed 
than under a subsequent condition of 
visual inspection. But such findings do 
not demonstrate that under usual circum- 
stances of visual exposure S actually 
uses visual information only to recognize 
or identity an object. The results of this 
experiment indicate that under condi- 
tions of rich information Ss can and do 
assess the sizes and distances of grossly 
enlarged or diminished familiar objects 
without bias by familiar size. 
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SPONTANEOUS RECOVERY OF LETTER-SEQUENCE HABITS ! 


ELI SALTZ 


Center for the Study of Cognitive Processes, Wayne State University 


Dominant letter-sequence habits were extinguished in 69 Ss and 
spontaneous recovery was measured 10 min., 1 hr., 2 hr., 3 hr., or 24 hr. 


after extinction. A significant (p < 01) d 


ery was measured across all Ss. 
extinction. 


The concept of spontaneous recov- 
ery has been used by theorists in the 
verbal-learning area (e.g., Gibson, 
1940; Underwood & Postman, 1960). 
However, very little direct work has 
been done to demonstrate the exist- 
tence and characteristics of spontane- 
ous recovery utilizing verbal ma- 
terials. Studies such as Briggs (1954) 
and Rothkopf (1957) have demon- 
strated that in a retroactive inhibition 
situation the A-B list recovers in 
strength as a function of time follow- 
ing A-C (or A’-C) acquisition. Such 
studies are only indirect evidence con- 
cerning spontaneous recovery since 
they are, strictly speaking, studies of 
the interaction over time between two 
competing S-R systems. No evidence 
appears to exist concerning the recov- 
ery of an extinguished response which 
has not been superseded by the 
strengthening of a competing verbal 
response to the same stimulus. 

The literature in the instrumental 
learning and the conditioning areas, 
in which the spontaneous recovery 
concept was originally developed, is 
inconclusive concerning the function 
relating recovery to time.  Ellson 
(1938) measured recovery between 5 
min. and 3 hr. after extinction of a 
bar-press response in the rat and found 

a continuously rising function. Lewis 
(1956) measured recovery between 1 
and 60 min. after extinction of a 


‘The present study was supported by 
National Science Foundation Grant G 21476. 


legree of spontaneous recov- 


Recovery was asymptotic by 1 hr. after 


running response in the rat and found 
that recovery had apparently not 
reached asymptote by 60 min. after ex- 
tinction. On the other hand, Graham 
and Gagné (1940) found that the 
asymptote of a recovery of a running 
response was reached after only 
several minutes. 

The function of the recovery curve 
poses a serious problem for writers 
like Underwood and Postman (1960) 
who suggest that at least some portion 
of forgetting is a product of interfer- 
ence arising from the recovery of 
responses extinguished during the 
learning of a new response. Should 
Tecovery of verbal responses reach 
asymptote shortly after extinction, 
only a limited portion of the forgetting 
curve could be accounted for by 
means of such a mechanism. 


Because of the theoretical impor- 
tance of the Underwood and Postman 
Position, the present study was de- 
signed in the context of one of the two 
models outlined by these writers, the 
letter-sequence interference model. 
This model states that previous ex- 
perience has developed letter-sequence 
habits among the various letters of the 
language. In learning a new response, 
certain of these sequences will be in- 
appropriate and must be extinguished. 
With time, these extinguished re- 
Sponses will recover in strength and 
compete with the newly learned 
Sequences, causing forgetting of the 
newly learned Sequences, 
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In the present study, the objective 
was to extinguish a dominant letter- 
sequence habit without reinforcing a 
competing letter sequence, and then 
to observe the recovery of the original 
sequence as a function of time. 


METHOD 


Subjecls.—The Ss were 15 students from 
introductory psychology at Wayne State 
University. These were assigned randomly 
to the five recovery intervals with the limita- 
tion that 15 Ss must be assigned to each 
interval. Analysis of the data revealed that, 
due to error by E, 6 Ss had not reached 
criterion of extinction before cessation of 
extinction trials, and these 5s were eliminated 
from the recovery analysis. 

Procedure.—The Ss were presented the 
following list of 20 letter combinations: VA, 
CO, KA, LO, HI, GA, FU, BE, Cl, HO, FI, GO, FA, 
DI, JA, LA, DO, JE, DE, and RA in three different 
Orders on successive trials. Instructions were 
to give the first letter that came to mind on 
Seeing each letter pair on the memory drum. 
For the first five trials, E made no response. 
From Trial 6 on, E systematically said, 
"Wrong" to any response made to a, pre- 
Selected five items. These were the critical 
Š items for which spontaneous recovery was 
to be measured. A different 5 items were 
randomly selected for each 5. To reduce the 
likelihood of voluntary suppression of the 
dominant response to the 5 critical items, E 
attempted to obscure the situation for S by 
randomly selecting an additional 5 items on 
cach trial and saying, “Wrong” to these also. 

O the responses to the remaining 10 items, 
E said, "Right." Thus S was systematically 
told his response was wrong for 5 items, and 
Sometimes told he was right, sometimes 
Wrong, for each of the remaining 15 items. 

he first five trials, during which E VR 
S no feedback, were used to determine t T 
dominant response to each of the five critica 
items. A response was considered dominant 


if it occurred on at least three a ea dep 
e extincti ies stopped when 9 fat e 
E nction series stopP! to all of the five 


Ive his dominant response " 
critical iras o ae hccessive trials. Items 
Were presented on a Lafayette drum at a 
“Sec. rate. Very few omissions occurred. 

he Ss were tested for spontaneous a 
covery 10 min. 1 hr, 2 hr, 3 hr, or 24 hr 
after extinction. Instructions were the same 
àS during the previous extinction series: Give 
NU first letter that comes to mind upon 
Seeing the letter pair in the drum. 


RESULTS 


Extinction.—The extinction rates 
were extremely variable between Ss. 
Number of trials to extinction (omit- 
ting the first five base-line trials) 
ranged from 4 to 43, with a mean of 
18.52 and an SD of 10.84. An esti- 
mate of the number of emissions of 
the dominant response to a critical 
item was obtained by averaging the 
number of such emissions (again 
omitting the five base-line trials) over 
the five critical items for each S. 
These scores ranged from 1.2 to 23.4, 
with a mean of 7.6 and an SD of 5.88. 
Obviously, some Ss were extremelv 
tenacious in holding onto the domi- 
nant responses while others gave up 
the responses readily. These re- 
sponses in almost all cases completed 
the initial two letters so as to con- 
stitute a familiar three-letter word. 

Spontaneous recovery.—lwo base 
lines are necessary for evaluation of 
spontaneous recovery. The first is 
degree of dominance of the critical 
responses prior to the onset of extinc- 
tion. This was readily obtained from 
the first five trials. Each S's domi- 
nant response was determined from 
each item; the number of times each 
dominant response occurred during 
the five trials was calculated and 
divided by 5 to give a mean number of 
dominant responses per trial. 

The second base line required is an 
estimate of response strength follow- 
ing extinction. During extinction, Ss 
were run to a criterion of two trials in 
which none of the dominant responses 
was emitted to the criterion items. 
Performance criteria, however, are 
known to overestimate the relevant 
effect on the criterion trials. Conse- 
quently, an assumption of complete 
extinction on the criterion trials is 
likely to give a spuriously high in- 
dication of spontaneous recovery. A 
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TABLE 1 
MEAN RESPONSES FOR PREEXTINCTION LEVEL, EXTINCTION LEVEL, 
AND SPONTANEOUS RECOVERY ABOVE EXTINCTION LEVEL 
Net Recovery 
Preextinction Eagnetion 
10 Min. 1 Hr. 2 Hr. 3 Hr. 24 Hr. 
4.40 1.97 0.00 0.92 1.14 -66 1.00 
b 0.31 1.31 2.60 1.84 1.81 1.59 1.29 
N 69 69 14 12 15 15 13 


more satisfactory base line for extinc- 
tion was calculated from the following 
reasoning : Call the final criterion trial 
X and the first criterion trial X — i 
it follows that at least one of the 
critical responses was emitted on 
Trial X — 2, or X — 1 would have 
been the final criterion trial. How- 
ever, no such mathematical constraint 
applies to Trial X — 3, Consequently, 
performance on Trial X — 3 was used 
as an estimate of response strength at 
extinction. It is recognized that per- 
formance on this trial is probably an 
overestimate of such Strength since 
three additional extinction trials fol- 
low; however, it was felt that an 
overestimate would be a more con- 
servative measure since it would make 
for greater difficulty in demonstrating 
that either extinction or Spontaneous 
recovery had occurred. 

Table 1 summarizes the data rele- 
vant to extinction and recovery, As 
can be seen in Table 1, there was little 
tendency to give any but the domi- 
nant response during the preextinc- 
tion period. This tendency dropped 
sharply by Trial X —3. A simple 
sign test shows that 90% of the 69 Ss 
dropped in tendency to give the 
dominant response, z — 6.51, P «.01. 
Similarly, over all 69 Ss, 69% show a 
recovery of the dominant responses 
following a cessation of extinction, 
z = 3.01, p «.01. Clearly, then, 
using a conservative estimate of ex- 
tinction, both extinction and spon- 


taneous recovery can be demonstrated 
in the present type of situation. 

Net recovery, indicated in Table A 
is the extent to which tendency to 
give the dominant response has in- 
creased over the X — 3 estimate of 
extinction. At 10 min. after extinc- 
tion there is no indication of spon- 
taneous recovery. Spontaneous re- 
covery appears at the 1-hr. test and 
does not increase between this point 
and 24 hr, However, analysis of 
variance over the five recovery periods 
yields an insignificant F of approxi- 
mately 1.00. 

Finally, it is interesting to note that 
neither number of overt responses 
Prior to extinction, nor level of 
response at the extinction base line 
(X — 3) is related to net spontaneous 
recovery in any of the groups. 


Discussiox 


The present study clearly indicates 
that extinction and Spontaneous recovery 
of letter-sequence habits occurs. Re- 
COVery is relatively sizable when one 
considers that the extinction base line is 
-7/ Items and the preextinction response 
level is 4.40 items, Producing a maximum 
Possible recovery range of 2.43 items. 
Compared to this maximum, the ob- 
tained recovery at 1 hr. is 3895; combin- 
ing the 1-, 2./ 3. and 24-hr. conditions, 
the recovery is 3095. As has been 
already Suggested, these are probably 
underestimates of recovery since the 


X-—3 base line underestimates the 
degree of extinction, 
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A conservative conclusion from the 
present data is that spontaneous recovery 
of letter-sequence responses is close to 
its obtained maximum at 1 hr. with the 
curve between 1 and 24 hr. being rela- 
tively flat. Since the material used in the 
present study elicits dominant responses 
that produce familiar three-letter words, 
test periods of longer than 24 hr. run the 
danger that Ss may practice the domi- 
nant habits during their normal extra- 
experimental activities; measured recov- 
ery might then be spuriously high indexes 
of spontaneous recovery. 

The obtained data of the present study 
suggest that no recovery has occurred by 

0 min. after learning. Since the actual 
amount of extinction. produced in this 
study is probably somewhat lower than 
that measured by the X — 3 base line 
employed, it appears reasonable to guess 
that some recovery has occurred in the 
first 10 min. Atany rate, it appears that 
the critical interval for future examina- 
tion is within the first hour after ex- 
tinction, 

The results of the present study sup- 
Dort the suggestion by Underwood and 
Postman (1960) that letter-sequence 
habits are subject to extinction an 
Spontaneous recovery paralleling that ob- 
tained in instrumental learning experi 
Ments, Peak and Deese (1937) have 
Previously demonstrated extinction an 
Spontaneous recovery of à verbal CVC 
response in paired associates using à 
Procedure that parallels the classical con- 

itioning experiment. The question 
remains, given that these processes eceur, 
What is the extent of their role in acquisi- 
tion and forgetting of verbal materials? 

ome caution must be exhibited in 
relating the present results to the eh 
Wood and Postman (1960) theory © 
forgetting. The present procedures ap- 
Pear to have resulted in @ much more 
tenacious tendency for Ss to overtly emit 
the responses being extinguished than is 
Usually found in verbal-learning studies 

i.e., in the typical verbal-learning muy 
almost no intrusions from outside the list 
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occur). While the data here reported 
indicate no relationship between number 
of responses prior to extinction and net 
amount of recovery, the issue cannot be 
considered closed; it is possible that 
asymptote of spontaneous recovery might 
be reached later if extinction were more 
rapid. However, with this caution in 
mind, the data in this experiment suggest 
that decrements which occur in retention 
of newly learned responses between 1 and 
24 hr. after learning cannot be attributed 
to increments in spontaneous recovery 
of the original dominant letter-sequence 
habits, since the spontaneous recovery 
curve is asymptotic over this interval. 
Spontaneous recovery could, of course, 
be relevant to forgetting which occurs 
within the first hour after new learning. 
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ASSOCIATIVE AND DIFFERENTIATION VARIABLES 


IN ALL-OR-NONE LEARNING ! 


CLESSEN J. MARTIN? 


Center for the Study of Cognitive Processes, Wayne State University 


An assessment of the stimulus-differentiation variable within the 1-trial 
learning controversy was made. All 20 Ss in each group learned the 
same final 12 paired associates to 1 perfect trial. The C group was 
presented the same 12 items on all trials. The E-1 group had a different 
pairing of the 3 S-R terms during the 1st 2 trials from that of the remain- 
ing trials. The E-2 group, during the 1st 2 trials, was presented 3 
stimuli which were then replaced by the 3 stimuli learned to criterion 


by the other 2 groups. 


Only data from the 20 Ss in each group who 


failed to respond correctly to any of the 3 critical items during the 1st 


2 test trials were analyzed. 


The results showed that significantly 


fewer number of trials were required to learn the 3 critical items by the 


C group as compared to the E-1 group. 
against the 1-trial learning position. 


"This finding provides evidence 
The importance of the stimulus- 


differentiation variable was shown by the fact that a significantly fewer 
number of trials were required tolearn the 3 critical items by the E-1 
group in comparison with the E-2 group. 


This study had two major objec- 
tives. The first objective was to 
provide an alternative procedure to 
the Rock-type design (Rock, 1957) 
for the investigation of one-trial 
learning. The proposed advantage of 
the present procedure is that the final 
list upon which the experimental and 
control groups reached criterion was 
the same. Therefore, the matter of 
item difficulty became less important 
in a comparison of the performance of 
the experimental group vs. the control 
group. The second objective was to 
assess the relevancy of the stimulus- 
differentiation variable within the 
one-trial learning controversy. 


1 This article is based upon a dissertation 
submitted to Wayne State University in 
partial fulfillment of the requirements for the 
degree of Doctor of Philosophy. The author 
gratefully acknowledges the assistance given 
by Eli Saltz. The investigation was sup- 
ported in part by a research grant, G-21476, 
granted to Eli Saltz from the National Science 
Foundation. A report of these results was 
presented at the 1963 American Psychological 
Association meeting. 

? Now at Michigan State University. 


The general procedure in this study 
consisted of presenting the control 
group (C) with the same 12 items on 
all trials. Experimental Group 1 
(E-1) had a different pairing of the 
three critical S-R terms during the 
first two trials from that of the remain- 
ing trials. Experimental Group 2 
(E-2), during the first two trials, was 
presented three stimuli which were 
then replaced by the three stimuli 
learned to criterion by the other two 
groups. 

_ The validity of the one-trial posi- 
tion was evaluated by comparing the 
C and E-1 groups with respect to the 
ease of learning the three critical 
items. "The one-trial position was 
interpreted as predicting no difference 

etween the C and E-1 groups. I! 
stimulus differentiation is considered 
tobea relevant factor, the incremental 
and one-trial Positions would predict 
the E-1 group to learn the three 
critical items sooner than the E-2 
group. 

Table 1 presents a summary of the 
fferential predictions made by the 


di 
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one-trial and incremental positions 
with respect to the relevancy of the 
stimulus-differentiation variable. 


METHOD 


Subjects. —A total of 75 undergraduate 
students enrolled in introductory psychology 
at Wayne State University was tested in this 
investigation. Only those Ss who failed to 
respond correctly to any of the three critical 
pairs during the first two test trials were used 
in the analyses. 3 

Materials.—Three 12-item paired-associate 
lists were constructed. Each of the three 
lists had the same nine pairs. The stimuli for 
the nine pairs consisted of high-meaningful 
(m) disyllabic nouns as scaled by Noble 
(1952). ‘The responses were single letters and 
were the same for all three lists. Three 
stimuli in each list consisted of low m value 
disyllabic nouns paired with single letters. 
All groups reached criterion on List 2 which 
consisted of the following pairs: WAGON-S, 
OFFICE-Y, VILLAGE-K, HEAVEN-F, INSECT-N, 
SAGROLE-Z, KITCHEN-M, BALAP-J, DINNER-H, 
JEWEL-D, NEGLAN-Q, GARMENT-X. The crit- 
ical items in List 2 are italicized. The three 
critical items in List 1 were: NOSTAW-Z, 
ZUMAP-J, and MEARDON-O. The three crit- 
ical items in List 3 were: SAGROLE-J, BA- 
LAP-Q, and NEGLAN-Z. 

Procedure.—The S-R pairs were presented 
ata 1-sec. rate during the learning trials on a 
Stowe memory drum. After each learning 
trial a test trial was given. Each stimulus 
Was typed in capital letters on a 3 x5 i 
card and during the test trials the stimu’ 
Were presented one at a time and S hiad un 
limited time in which to respond. a 
learning and test trials were alternated unti 
Ss reached a criterion of one errorless is 
trial. The S-R pairs were presented in three 
orders on the learning trials and the S ere 
Were presented in three different orders on the 
test trials, 

Three groups o 
group, Experimental 1 gr 
Mental 2 group. : 

_ Control group (c) 1 h 
List 2 throughout all trials. This group 

Experimental 1 grot E-1).— Wee e 
Was presented List 3 during the first vith 
learning and test trials. Begin PARE 

-earning Trial 3, List 2 was presente! or i 
remaining trials. The only difference 1 
tween the two lists consisted of a di es 
Pairing of the stimuli and response? among the 
three Jow-m pairs, 


ed: Control 


were test r 
fs nd Experi- 


oup, & 


is group learned 
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TABLE 1 
DirFERENTIAL PREDICTIONS MADE BY THE 
ONE-TRIAL AND INCREMENTAL POSITIONS 

WITH RESPECT TO THE EASE OF LEARNING 
or THE THREE CRITICAL S-R ITEMS 
AS A FUNCTION OF THE RELE- 
VANCY OF THE STIMULUS 
DIFFERENTIATION 
VARIABLE 


Theoretical Acquisition of an 
S-R Association 
Stimulus 
Differentiation 


One-Trial Incremental 


Relevant 
Not relevant 


Experimental 2 group (E-2).—This group 
was presented List 1 during the first two 
learning and test trials. On Learning Trial 3 
List 2 was presented for the remaining trials. 
For this group, new low-m stimuli were in- 
serted in place of.the three low-m stimuli 
presented during the first two learning and 
test trials. 

The Ss were assigned alternately to the C 
and E-2 groups as they appeared at their 
scheduled time of testing. A total of 20 Ss 
who failed to respond correctly to any of the 
critical items during the first two test trials 
was tested in each of the two conditions. 
Subsequent testing involved testing 20 more 
such Ss in the E-1 group. 


RESULTS AND DISCUSSION 


The results were based upon the 20 
Ss in each of the three groups who 
failed to respond correctly to any of 
the three critical items during the 
first two test trials. Five Ss in each 
of the three groups made one or more 
correct responses to the three critical 
S-R pairs on the first two test trials. 

In order to determine whether the 
three groups were comparable in 
learning ability prior to the introduc- 
tion of the experimental manipulation, 
the Kruskal-Wallis was used to anal- 
yze the total number of correct re- 
sponses made on the first two test 
trials. The use of the Kruskal-Wallis 
was necessary because of the differ- 
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TABLE 2 


MEDIAN AND SEMIINTERQUARTILE RANGE OF 
i THE NUMBER OF TRIALS REQUIRED 
TO MAKE THE FIRST CORRECT 
RESPONSE TO THE THREE 
CRITICAL ITEMS 


| Groups 


€ E-1 | E-2 
Mdn. 13.00 | 17.00 | 23.00 
Q 2.25 4.38 8.92 


ences in the form of the three distribu- 
tions and also because of heterogeneity 
of variance. The Kruskal-Wallis 
ranked analysis of variance showed 
that the three groups were not signifi- 
cantly different from each other at the 
-05 level (x? = 3.55). 
he analysis of the rate of learning 
the three critical items was crucial to 
an interpretation of the data in this 
investigation. The response measure 
for each S consisted of the total 
number of trials required to make the 
first correct response to all three 
critical pairs. The median and semi- 
interquartile range of the number of 
trials required to make the first correct 
response to all three item 
sented in Table 2. 

In order to determine which of the 


S are pre- 
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ent, the Mann-Whitney U test was 
used. When the C group was com- 
pared with the E-1 group, the result- 
ing U value of 115.5 was significant at 
the .025 level for a one-tailed test. 
This comparison provides negative 
evidence for one-trial learning. It 
indicates that when the same three 
low-m stimuli and responses are re- 
tained after Test Trial 2 but are re- 
paired, there is a significant decrement 
in the learning of these three S-R 
pairs as compared to the learning of 
the three pairs in the C group. 
Regardless of whether one assumes 
the one-trial or incremental position, 
the E-1 and E-2 comparison provides 
an evaluation of the relevancy of the 
stimulus-differentiation variable. The 
resulting U value of 128.5 was signifi- 
cant at the .05 level and indicates a 
slower learning rate among the three 
inserted items in the E-2 group as 
compared to the three re-paired items 
in the E-1 group. 
The only position in Table 1 which 
does not predict a difference between 
the C and E-2 groups is the one-trial 
position that considers differentiation 
to be irrelevant in the learning of an 
S-R association. This comparison 


yielded a U value of 76 which was 
significant at the ,0 


, X $ 02 level and indi- 
comparisons were: significantly differ- cates a faster learning rate for the 
TABLE 3 


Predicted Theoretical Relationships 


ai Level of 
Reationships Significance Incremental One-Trial 
|) SE Not Rely, Diff. Rely Not Relv. 
CO < El 025 CE | Gu Eel 
C < E2 002 CZEA2 Je ese 
E-1 < E-2 05 E-1 < E-2 E-1 = Ẹ E-l < EQ 
Note.—Diff. = Differentiation, Relv. = Relevant. 


ASSOCIATION AND DIFFERENTIATION 


three critical items in the C group as 
compared to the E-2 group. 

Table 3 presents a summary of the 
obtained relationships among the 
three groups on the number of trials 
required to respond correctly to the 
three critical items and the predictions 
made by the four theoretical positions 
presented in Table 1. The C and E-1 
comparison provides positive evidence 
for an incremental development of 
associative strength between the three 
low-m pairs. Comparing the E-1 
group with the E-2 group demon- 
strated the relevancy of stimulus 
differentiation in the formation of an 
S-R association. The comparison 
between the C group and the E-2 
group demonstrated the combined 
effect of the subthreshold develop- 
ment of associative strength between 
the three low-m pairs and the lack of 
stimulus differentiation among the 
three low-m stimuli. E 

The total trials to reach criterion 
for the 12-item list were analyzed by 
the use of al X 3 analysis of variance 
design. Table 4 presents the means 
and standard deviations of the total 
number of trials to reach criterion for 
the three groups. The. F (2, 57) 
= 1.66, did not approach significance 


at the .05 level. 
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TABLE 4 


MEANS AND SDs OF THE TRIALS TO CRITER- 
ION FOR THE C, E-1, AND E-2 Groups 


Groups 
[e E-1 E-2 
M 11.55 15.00 15.85 
SD 7.84 8.10 7.74 


Thus, in spite of the fact that the 
three low-m pairs required increas- 
ingly more trials in the C, E-1, and 
E-2 groups before a correct response 
was given, there was not a significant 
difference in the total number of trials 
to reach criterion for the three groups. 

The results have been interpreted 
as supporting an incremental develop- 
ment of associative strength. They 
also demonstrated the importance of 
the stimulus-differentiation variable 
in learning verbal associations. 


REFERENCES 


Noste, C. E. An analysis of meaning. 
Psychol. Rev., 1952, 59, 421-430. 

Rock, I. The role of repetition in associative 
learning. Amer. J. Psychol, 1957, 70, 
186-193. 


(Received January 28, 1964) 


Journal of 


Experimental Psychology 
1965, Vol. 


69, No. 3, 312-316 


EVENT SALIENCE AND RESPO 
TEN-ALTERNATIVE PROB 


NSE FREQUENCY IN A 
ABILITY-LEARNING 


SITUATION : 


LEE RO 


Y BEACH 


Office of Naval Research 
AND RICHARD W, SHOENBERGER 


Behavioral Sciences Laboratory, 


In the two 
ing situation 
that the rela 
responses clo 


"event probability-learn- 
it is commonly found 
tive frequencies of Ss’ 
sely match the events’ 
relative frequencies of occurrence, In 
multiple-event situations, however, it 
is found that as training progresses Ss’ 
response frequencies “overshoot” the 
event frequencies for the more fre- 
quently occurring events and “under- 
shoot" them for infrequently Occur- 
ring events (Cotton & Rechtschaffen, 
1958; Gardner, 1957 1958; Mc- 
Cormack, 1959), 

The most apparent vari 
can be suggested as a possi 
overshooting in the mu 
situation is the number of 
events. Indeed, in two stu 
and Rechtschaffen (1 
ner (1957) found the 
shooting was greater fi 
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ble cause of 
Itiple-event 
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ts is necessarily the 
ofovershooting. Per- 
to which the relative 
high-frequency event 
tive frequency of the 
of equal or greater im- 
Portance, That is, perhaps the more 

Salient an event is, due to its excep- 

tionally high rate of occurrence, the 

more Ss’ responses its 
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frequency of the 
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overshoot 


EVENT SALIENCE AND PROBABILITY LEARNING 


objective relative frequency of occur- 
rence. Overshooting the high-fre- 
quency events would, of course, result 
in undershooting the lower frequency 
events. 

While a more complex definition 
may be necessary for future investiga- 
tions, in the present study salience 
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was defined as the “sharpness” of the 
peak in the distribution caused by an 
event occurring with a higher relative 
frequency than the events which sur- 
round it on some underlying physical 
or conceptual dimension. This defini- 
tion permits the computation of a 
simple Salience Index value for the 
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TABLE 1 


RELATIVE FREQUENCIES OF EVENTS (E) AND oF RESPONSES (R) 
OVER Last 150 TRIALS ror EACH Group or Ss 


Group 


Event 


o 1 2 3 


01 | .06 | .14 | 25a 
00 | 02 | ‘16 | 36 


.02 05 12 £ 
-02 03 08 ES 


92 | 05 | 10 |. 
00 | 02 | 06 |* 754 


01 -03 -06 -10 
'.00 -01 02 -03 


-282 4 .06 01 


-03 -06 135 10 
-01 .02 .08 .05 


E 


278 12 05 02 


a 
e| mm mj am| am| xu 


05 10 s 10 
-01 .02 .09 05 


299 -10 05 .02 


^ High-salience events, 


high-frequency events by the equation 
Sle, = 2Pr(E,) —Pr (En-1) — Pr(Eny1) 


where Pr(E,) is the relative frequency 
of the high-frequency event and 
Pr(E,;) and Pr(E,,1) are the relative 
frequencies of the events on either 
side of E, on the underlying dimen- 
Sion. A high positive value of SI 
indicates a high degree of salience for 
Event En. It was Predicted that Ss 
tendency to overshoot for Event By 
would be positively related to its 
degree of salience, 


METHOD 


The apparatus consisted of a 34X8 in, 
panel on which was mounted a row of 10 
white event lights numbered from 0 to 9. 
Above the row of white lights was mounted an 
amber “ready light.” The Ss were Provided 
with IBM Port-a-Punch boards and were in- 
structed that when the ready light came on 
they were to punch into their IBM card the 
number of the event light they expected to 
come on next. (After having punched their 
prediction and checked its accuracy Ss 
covered it with another card; this prevented 


pause, 2 sec, ready light again, etc. s 
were run for approximately 1 hr, each day 
until they reached à total of 1,500 trj 

Stimulus 


g Command, Pensa- 
cola, Florida, While this was not a random 
Procedure it was a close approximation. Each 
experimenta] group consisted of one class (the 
Classes had no contact with one another) and 
there Were six groups: 10 Ss in Group I, 12 
Ss in Group II, 14 Ss in Group III, 15 Ss in 
Group IV, 13 Ss in Group V, and 13 Ss in 
Group VI. 


, Each group of Ss was presented with a 
different 


Three 


Present a progression 


in salience for Events 3 and 7, and three 
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represent a progression in salience for Events 
2and6, For Group I the salience index is .17 
for Event 3 and —.01 for Event 7; for Group 
II SI; 2.29 and SI; = .10; for Group III 
SI; = 41 and SI; = .14; for Group IV SI; 
= —.01 and SI, = .23; for Group V Sls 
7.10 and SI; 29; and for Group VI 
Sl: = .14 and SI, = 41. 


(50) 1 2 3 4 5 
Group III 


RELATIVE FREQUENCY 


00 2 3 
Group I 


curves for th 


Fic. 2. Response m 
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RESULTS 


On the graphs of the stimulus-fre- 
quency distributions in Fig. 1 are 
superimposed the mean proportion of 
Ss giving each response averaged over 
the last 150 trials. These results, also 
given in Table 1, only partially sup- 


oO 
(50) 1 2 3 4 5 
Group VI 


TRIAL BLOCKS 


wo high-salience events. Y P 
aes baed on blocks of 150 trials. The numbers on the 


(The first point on the curve 


15 based on 50 trials. All othe E the curve is related.) 


Tight indicate the event to whic 
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port the hypothesis that overshooting 
is related to event salience. The 
increase in salience from Group I to 
Group III for Event 3 and from 
Group IV to Group VI for Event 6, 
is reflected in greater overshooting. 
(Spearman rank-order correlation be- 
tween the difference between Pr(Z£;) 
and the observed response relative 
frequency and SI for the high-salience 
event is .89, p = .05, n = 6.) How- 
ever, similar increase in salience for 
Events 7 and 2, while reflected in high 
response frequencies, do not show a 
similar pattern of overshooting. 

The graphs in Fig. 1 reveal that 
overshooting on the  high-salience 
event has exercised a nonsymmetrical 
influence on the response frequencies 
for the other events. Events near the 
end of the 0-9 scale appear to be 
undershot to a greater degree than 
those in the middle of the scale. 
Moreover, events which are adjacent 
to the high-frequency event and which 
lie toward the middle of the scale, were 
either overshot or less undershot than 
the adjacent events on the other side 
of the high-frequency event. Clearly, 
these results reflect generalization 
effects. However the data are not 
sufficient to reveal the nature of these 
effects. 

Research by Jenkins and Cunning- 
ham (1949) has shown Ss to have 
rather strong number preferences 
when asked to produce free-response 
lists of numbers from 1 through 10; 
the numbers 3 and 7 were the most 
preferred. Such preferences probably 
had some influence on responses in the 
present experiment. However, if it is 
assumed that all groups had roughly 
equivalent number preferences, the 
systematic differences in overshooting 
from group to group as a function of 
the salience of the high-frequency 


event (Fig. 1) rules out the possibility 
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that the findings could be accounted 
for in terms of number preferences. 
In addition, the fact that salience 
produces overshooting for Events 2 
and 6 as well as for the preferred 
Events 3 and 7 further reduces the 
possibility that number preferences 
cause overshooting. 

The results in Fig. 1 are also re- 
flected in the response curves in Fig. 2. 
These curves indicate that there was 
a rapid and fairly consistent increase 
in responses to the high-salience 
events throughout training, an in- 
crease which had not leveled off after 
1,500 trials. The slopes of these 
curves generally correspond to the 
rank orders of the high-salience events 
for the various groups. On the other 
hand, the curves for the second most 
salient events leveled off early in train- 
ing and subsequently even decreased 
for four of the six groups. 

In short, the results indicate that 
an event's salience is not sufficient to 
promote overshooting. However, for 
the most highly salient event the 
degree to which overshooting occurs 
is positively related to the degree of 
salience. 
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DIFFERENTIAL EFFECTS OF STIMULUS AND RESPONSE 
ISOLATION IN PAIRED-ASSOCIATE LEARNING ! 
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Either the stimulus or the response of 1 S-R pair in an 8-pair list of 2- 


letter consonants was isolated by 


printing it in red. Lists were pre- 


sented either in the usual paired-associate manner or in a manner such 
that the stimuli and responses were never simultaneously exposed. 
Both stimulus isolation and response isolation produced an effect 
(p < .001), but the effect of stimulus isolation was greater (p < .001). 
Manner of presentation produced no detectable effect. Results were 


interpreted to be inconsistent with 
Schulz's (1959) assumption that 
sponse similarity will produce equ 
phase of learning, 
von Restorlf effect 
generalization. 


While the more rapid learning of 
unique items in lists of verbal material 
seems well established empirically, 
there is no generally accepted theo- 
retical explanation for this isolation, 
or von Restorff, effect (Erickson, 
1963). A prerequisite to the develop- 
ment of such an explanation 1s the 
acquisition of additional data relevant 
to the properties of the isolation effect. 
l'he purpose of the present study was 
to gather such data by investigating 
more precisely the relative efficacy of 
isolating a stimulus as opposed to à 
response element in à paired-associate 
list. Such data are also relevant to 
theories of verbal learning in general, 
particularly those stressing the role of 
intralist associative interference. 

Studies of differential stimulus-re- 
sponse isolation effects in paired- 
associate learning are rare» and the 


data available are of questionable 
; Grant 
! Thi as supported by Grant 
MH ent can the National Institutes 
of Health. Pilot work on the study was Mi 
Possible through Grant G-12986 from th 
National Science Foundation. 
The author wishes to thank Mn Sre 
and Leon Swartzendruber for the! 
ata collection. 


or (b) an S-R int 
that is based solely upon reduced stimulus or response 


either (a) Underwood, Runquist, and 
equal increases in stimulus and re- 


al interference effects in the associative 


ralist interference explanation for the 


value. Kimble and Dufort (1955), in 
their Exp. Il, inferred that their 
results supported the implication that 
the isolation effect is a perceptual one 
that operates chiefly on stimuli, since 
they felt that they had demonstrated 
an isolation effect under a condition 
of stimulus isolation and an "inverse" 
isolation effect under that of response 
isolation. However, their design con- 
founded the conditions of stimulus 
and response isolation with the kinds 
of material functioning as stimuli and 
responses. In addition, their infer- 
ence was based upon a misinterpreta- 
tion of the meaning of a significant P 
for interaction. Their significant F 
for interaction can only be legiti- 
mately interpreted to indicate that 
the relative ease of learning isolated 
and nonisolated pairs was different 
under the conditions of stimulus and 
response isolation. In order to be 
valid, Kimble and Dufort's inter- 
pretation that the isolated pairs were 
learned more easily than the non- 
isolated pairs when isolation was on 
the stimulus side would require a 
significant F for the simple main effect 
of isolation on the stimulus side. A 
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test of this effect (Winer, 1962, p. 
311), using the published data, yielded 
a nonsignificant result, / (1, 26) 
= 1.24, p > 25. 

Nachmias, Gleitman, and McKenna 
(1961) attempted a more rigorous in- 
vestigation of differential stimulus- 
response isolation effects, from which 
they concluded that isolation is effec- 
tive on both the stimulus and the 
response sides of paired-associate lists, 
and “to about the same extent." 
However, the fact that they isolated 
either two stimulus or two response 
elements in each of their nine-pair 
lists seems unfortunate. Pillsbury 
and Raush (1943) have demonstrated 
a regular decrease in the advantage of 
isolated items as the ratio of disparate 
material to massed material is in- 
creased in serial lists. By using two 
isolated pairs in each list rather than 
one, Nachmias, Gleitman, and Mc- 
Kenna very likely attenuated the 
isolation effect, and may have thereby 
reduced the probability of detecting 
any real difference existing between 
the effects of stimulus and response 
isolation. 

Erickson's (1963) subsidiary finding 
that isolation of stimuli and responses 


TABLE 1 


Basic ITEMS USED IN ALL 32 
EXPERIMENTAL Lists 


Item Position 1 Item Position 2 


RV-MB* MB-RV« 
TL-CN CN-TL 
GK-SD* SD-GK* 
BJ-HG HG-BJ 
LF-VM* VM-LF* 
DH-FP FP-DH 
NR-PC* PC-NR* 
JT-KS KS-JT 


Jote.—Lists were presented in eight different orders 
D that each pair appeared once at each of the eight 
Sucitions and there was never more than one repetition 
pes adjacent pairs in running through eight con- 
rative trials of a list. The eight orders were the same 
for all lists. 
^ [solated pairs. 
given list. 


Only one pair was isolated in any 
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did not produce detectably different 
effects suffers from a somewhat similar 
limitation. In the relevant part of 
this study, isolation of each critical 
S-R pair was produced by (a) the 
unique relationship between the items 
constituting the critical pair, and (b) 
differential stimulus-response isola- 
tion along the dimension of color. 
Since any differential effects produced 
by stimulus vs. response isolation 
were in addition to the undifferen- 
tiated isolation effect produced by the 
unique relationship between the items 
of the critical pair, the fact that the 
data suggested a larger isolation effect 
under stimulus isolation than under 
response isolation demands further 
investigation. 

The major purpose of the present 
study, then, was to further delineate 
the differential effects of stimulus and 
response isolation, using paired-asso- 
ciate lists in which the stimuli and the 
responses were composed of the same 
type of material and represented com- 
parable levels of intralist similarity- 
Differential stimulus-response isola- 
tion was produced by printing either 
the stimulus or the response of à 
critical pair in red, while all other 
items appeared in black. The lists 
were otherwise identical under the 
two main experimental conditions. 

An additional purpose was to de- 
termine the effect of altering the usual 
paired-associate procedure by pre- 
senting lists in such a manner that the 
stimuli and responses were expose 
for equal amounts of time and were 
never simultaneously visible to S. 


METHOD 


Materials and apparatus.—Table 1 shows 
the eight pairs of basic items that were use 
in all lists, Item Position 2 being a mirror 
rage of Item Position 1. Four pairs func- 
ies "p isolated pairs, only one pair being 
th ed in any given list. List 1 consisted of 

e eight pairs shown in Item Position b 
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with the stimulus item of the critical pair 
RV-MB printed in red. All nonisolated items 
appeared in black only. Lists 2-4 were 
identical to List 1, except that different 
critical stimuli (Gk, LF, and NR) were isolated 
by being printed in red. Lists 5-8 replicated 
the first four, except that the response items 
were isolated along the color dimension in the 
above fashion. 

Lists 9-16 replicated the first eight, except 

that the critical element of the isolated pairs 
was not only printed in red, but also ap- 
peared in a larger and different style type. 
Lists 17-32 were identical to Lists 1-16, 
except that they were their mirror images, 
consisting of the particular S-R pairs shown 
in Item Position 2. : 
. The four pairs selected to function as 
isolated pairs were chosen because they were 
the pairs most comparable to each other in 
difficulty level in both item-position versions 
of the list. The basis for the decision was 
data from a pilot study involving 32, Ss. 

The lists were printed on a printing press 
on 80-Ib. fotolith paper. The symbols were 
15 in. in height, printed in 36-point Clarendon 

ook type. The larger isolated items used on 
half the lists were 3$ in. in height, and ap- 
Peared in 36-point Twentieth Century Extra 

‘old Italic, Each list appeared on an endless 
tape in eight different orders, with no spaces 
Separating the orders. A Gloric memory 

rum with a specially constructed neon light 
assuring equal illumination of the stimulus 
and response apertures was USCC. the 
memory drum permitted two alternative 
Sa posure patterns, both based upon 2 4-47 
ec, time sequence. sys 

Design e procedure.—In addition to the 
Our variables specified above 1n the UM 
tion of the 32 lists, two other indepen e 
Variables were included in the design” (a) the 
Particular E who collected the data; (b) stim- 


ulus exposure time. For half the Ss, the lsrs 

Were presented in the usual paired-associa 
was exposed for a 

of which were 


te Ponse item. For the ee 

the stimulus shutter closed at 

Just as the response shutter opened ES RE 
€ response item for 2 sec- Under the 


Condition $ us and response items 
the stimul visible. Before 


ere never simultaneously 

€ experiment began, each of the 128 Ss = 
randomly assigned to a specific conditios um 
completely counterbalanced factorial En 
involving the six variables that have bee 
escribed, 


os ië 
Each S first practiced to 2 criterion of o 


perfect recitation on a list of six pairs of two- 
syllable nouns. He was then presented with 
his particular experimental list under the ap- 
propriate stimulus shutter condition. Learn- 
ing continued for a total of 24 trials. The Ss 
were instructed to spell out their responses 
and were told not to be afraid to guess and 
make mistakes. Responses were recorded 


verbatim. 

A minor variable that was controlled but 
not counterbalanced was the particular order 
of the list that S received as his initial order of 
presentation. Previous work involving a 
similar task (Erickson, 1963) demonstrated no 
effect of starting order on the size of the 
isolation effect. In the present study, starting 
order was confounded with specific isolated 
pairs so that each S received a starting order 
that placed his particular isolated pair in 
Position 4 in the initial presentation of the 


list. 
Subjeds.—The Ss were 128 volunteer 


undergraduate psychology students at Whit- 
tier College. There were 42 males and 86 
females. One substitute was run to replace 
an S from the response-isolation condition 
who was rejected for failing to meet a pre- 
determined criterion of 10 correct nonisolated 
responses of any kind during the 24 trials. 


RESULTS 


The dependent variables analyzed 
were: (a) mean number correct re- 
sponses Over Trials 1-24 to the three 
critical nonisolated pairs, each of 
which was used in some other list as 
the isolated pair; (b) difference be- 
tween number correct responses to 
isolated pair over Trials 1-24 and 
mean number correct responses to 
critical nonisolated pairs over Trials 
1-24; (c) mean number correct re- 
sponses over Trials 1-24 to the four 
noncritical nonisolated pairs that were 
not used in any other list as the 
isolated pair. The dependent variable 
described under b was the indicator of 
the isolation effect. The N (noniso- 
lated) scores, unless otherwise speci- 
fied, were based only on the rate of 
learning the three critical nonisolated 

airs described above under a. 

Table 2 shows mean correct re- 
sponses for isolated and critical non- 


ow 
N 
> 


TABLE 2 


MEAN CORRECT RESPONSES PER S PER 
PAIR OVER TRIALS 1-24 FOR ISOLATED 
AND CRITICAL NONISOLATED PAIRS 
UNDER CONDITIONS OF STIMULUS 
AND RESPONSE ISOLATION 


Isolation Cond. 
Response 
Category Stimulus Response 
Isolated Isolated 

solated E 
bis (1) 18.58 13.52 
Nonisolated ra 

Pairs (N) 9.45 8.67 
Difference 

(I — N) 9.13 4.85 


isolated pairs under the two conditions 
of isolation. It also shows the mean 
I — N score (isolated COrrect minus 
mean critical nonisolated correct), 
which represents the isolation effect. 
Figure 1 presents the learning curves 
for isolated and critical nonisolated 
pairs within each of the conditions of 
isolation. 

As indicated by Table 2 and Fig. 1, 
the isolated pairs were learned more 
rapidly than the critical nonisolated 
pairs under both conditions of isola- 
tion, but the advantage of isolated 
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over nonisolated pairs was greater 
under the stimulus-isolation condition. 
The mean difference of 4.28 in the 
respective I — N scores is significant, 
as shown by the F ratio for Stimulus 
vs. Response Isolation obtained in an 
analysis of variance of I — N scores, 
F (1, 104) = 12.53, p < .001. 

An independent analysis of variance 
was carried out on the I — N scores 
obtained under the response-isolation 
condition only. It yielded a signifi- 
cant Ffor the mean, F (1,63) = 28.19, 
p < .001, indicating that the mean 
difference between isolated correct and 
mean critical nonisolated correct, 
though significantly smaller than the 
corresponding difference obtained un- 
der stimulus isolation, was a signifi- 
cant difference. 

The main analysis of variance of the 
I—N scores evaluated all main 
effects and all two-way interactions 
involving the Stimulus vs. Response 
Isolation variable. In addition, two 
selected two-way interactions and 
two three-way interactions involving 
the S-R Isolation variable were eval- 
uated, the remaining interactions 
being pooled for error. Other than 
the highly significant F for the main 
effect of Stimulus vs, Response Isola- 


RESPONSE 
ISOLATED 


HL, 
9-—-9 Isolated pairs 


Nonisolated pairs 


SUCCESSIVE 4-TRIAL BLOCKS 


€) .60 STIMULUS 
c ISOLATED 
e .60 
Lu 
Z 40 
<1 
Lu 
= 20 

.00 

l 2 3 4 5 

lic. 1. 


Mean errors per S per pair per trial made to isolated and 
pairs averaged over four-trial blocks for each isolation ci 


critical nonisolated 
ondition, 
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TABLE 3 
ANALYSIS OF MEAN ERROR RATE PER S OVER "TRIALS 1-24 ror ISOLATED 
PAIR UNDER CONDITIONS OF STIMULUS AND RESPONSE ISOLATION 
Incorrect Responses to Critical Stimulus* 
z Incorrect 
ons Han. | e Elicitation 
nd. s 2 sites artial i of Critical 
Mean Total | No Intralist  OTTeC Stimulus sa reae 
Errore per S Response Intrusions Reece Items | Semonsg! 
Stimulus | | - | | | 
Isolated 5.42 60 | .09 .07 402: | 22 | 07 
| 
| | 
Response | | 
Isolated | 10.48 59 | AS | 06 01 A6 | HM 
Y t 


^ Error rate determined by d 
made by an S. pn 
b b Error rate determined by dividin 

Y total number incorrect responses ma: 
© Partial correct responses defined as responses 


tion, only one significant F was ob- 
tained, that for the interaction be- 
tween Stimulus vs. Response Isolation 
and Item Position, F (1, 104) = 442, 
P «.05. Inspection of cell means 
Showed that: (a) stimulus isolation 
Maintained an advantage over Te- 
sponse isolation in both item positions 
of the list; and (5) in the case of every 
critical item, a larger I — N score 
Was obtained when that item was 
isolated on the stimulus side than 
When it was isolated on the response 
Side. The significant interaction re- 
flects the fact that the advantage of 
Stimulus over response isolation was 
Sreater when the lists were presented 
in Item Position 1 than when the lists 
Were presented in ltem Position 2, 
The cause of this difference 18 un- 
known, . 

An analysis of variance identical to 
that described above was carried Em 
On the critical nonisolated scores. No 
Significant Fs were obtained. The 
effect of Stimulus vs. Response Isola- 
tion did not approach significance, 
F (1,104) = 0.67. One further check 
On possible differential rates of learn- 
ing nonisolated pairs under the two 
Main isolation conditions Was made. 

separate analysis of variance 0n the 


iding number of errors made in a particular category by total number of errors 


number incorrect elicitations of critical response by a particular stimulus 


de to that stimulus by an 5. 
with one correct letter. 


noncritical nonisolated scores (those 
scores based on the four nonisolated 
pairs in each list that did not appear 
in any other list as the isolated pair) 
was carried out. The result was non- 
significant, (1,126) = 3.49, p > .05. 

Table 3 presents an analysis of the 
errors made to isolated pairs. Intra- 
list intrusions (of response items from 
the seven nonisolated pairs) consti- 
tuted a small proportion of total errors 
at the critical position. However, it 
is of interest to note that, when cor- 
rected for opportunity, the rate of 
jntralist intrusions at the critical 
position and the rate of incorrect 
elicitations of the critical response (by 
the seven stimuli of nonisolated pairs) 
was twice as large under the response- 
isolation condition as under the stim- 
ulus-isolation condition. The rates of 
the other types of response error at 
the critical position were approxi- 
mately the same under the two 
isolation conditions. 


DISCUSSION 


The important findings of this investi- 
gation were: (a) isolation of either a 
stimulus item alone or a response item 
alone can produce the von Restorff effect 
in paired-associate learning; (b) isolation 
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of a stimulus item produces a greater 
isolation effect than isolation of a re- 
sponse item in paired-associate lists in 
which stimuli and responses represent 
comparable levels of intralist similarity, 
and; (c) this advantage of stimulus over 
response isolation is not contingent upon 
the longer exposure time for stimulus 
items characteristic of the usual paired- 
associate procedure. . : 
The present results are not inconsis- 
tent with any of the major theoretical 
explanations for the von Restorff effect 
discussed by Erickson (1963). However, 
special attention will be given here to 
some of the implications of the current 
findings for the S-R intralist interference 
explanation for isolation phenomena. 
Such an explanation would attribute the 
isolation effect found here to a reduction 
in intralist interference, resulting from 
either reduced stimulus or reduced re- 
sponse generalization of the isolated 
item. If it is assumed that the interfer- 
ence effects resulting from stimulus gen- 
eralization are greater than those result- 
ing from response generalization, it 
follows that isolation of a stimulus item 
will produce a greater reduction in intra- 
list interference (and hence a greater 
isolation effect) than a comparable degree 
of isolation of a response item. While 
the rate of intralist intrusions was small, 
the intrusion error data shown in Table 
3 are consistent with such an explanation. 
The smaller isolation effect obtained 
under response isolation was accom- 
panied by a rate of intralist intrusion 
errors that was twice as large as that 
obtained under stimulus isolation, 
Support for the contention that the 
interference effects resulting from stim- 
ulus generalization are greater than those 
resulting from response generalization is 
found in the data reported by Under- 
wood (1953a, 1953b) and Underwood, 
Runquist, and Schulz (1959). However, 
the latter investigators interpreted these 
same data somewhat differently. They 
maintained that the increased intralist 
interferences produced by equivalent in- 
creases in stimulus and response simi- 
larity can be assumed to have equal 
effects on the associative (as opposed to 
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the response-learning) phase of paired- 
associate learning. According to their 
interpretation, the empirical finding that 
increased stimulus similarity produces 
greater learning difficulty than a com- 
parable increase in response similarity 
would be explained by the fact that an 
increase in response similarity makes the 
response-learning phase easier by circum- 
scribing what has to be reproduced, an 
effect not duplicated by increases in 
stimulus similarity. y 

Underwood, Runquist, and Schulz s 
interpretation was based upon studies of 
the differential effects of increased stim- 
ulus and response similarity along dimen- 
sions of either synonymity or formal 
letter structure. From within this con- 
text, it would be predicted that making 
a response item more disparate in a list 
would increase the diffculty of the 
response-learning phase for that item. 
This would not seem a reasonable pre- 
diction when the disparity of the response 
item is along the dimension of color, 25 
it was here. There would seem to be no 
reason to assume that the printing of ‘a 
response in red should increase the diffi- 
culty of the response-learning phase for 
that item; in fact, the most likely pre- 
diction would be just the opposite. 
this is so, the results of the present 
investigation support the contention that 
the interference effects of a given leve 
of stimulus similarity are greater on the 
associative phase of learning than are the 
interference effects of a comparable leve 
of response similarity, since the same 
degree of differentiation produced P 
greater isolation effect when stimuli were 
involved. 

The foregoing interpretation assumes 
an S-R intralist interference explanation 
based solely upon reduced stimulus O" 
response generalization to be adequate 
to account for the differential isolatio™ 
effect found here. Unless the effect was 
Produced solely by a reduction in stim- 
ulus or response generalization tend- 
encies, the present results are not 
necessarily relevant to Underwood, Run- 
quist, and Schulz's assumption concern- 
Ing intralist interference effects. How- 
ever, if it is held that this type of S-R 
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explanation is adequate to account for 
the differential isolation effect found here, 
it cannot at the same time be maintained 
that the interferences produced by equal 
levels of stimulus and response similarity 
have equal effects on the associative 
phase of paired-associate learning. It is 
appropriate to point out that while the 
present data are not inconsistent with 
such an S-R explanation, previous studies 
(Erickson, 1963; Newman & Saltz, 1958; 
Saltz & Newman, 1959) have indicated 
that an S-R intralist interference ex- 
planation based solely upon reduced 
Stimulus or response generalization is not 
adequate to account completely for the 
von Restorff effect. 

À noteworthy subsidiary finding of the 
present study was that the alteration in 
Stimulus exposure time used here pro- 
duced no detectable effect on the case of 
learning either the isolated or the non- 
isolated pairs. Studies comparing the 
relative efficacy of paired-associate and 
Serial learning (e.g Primoff, 1938; 
Young, 1959) have typically ignored a 
Substantial procedural difference between 
the two tasks. In the usual paired- 
associate task, stimuli are exposed twice 
as long as responses, which are never 
exposed without their appropriate stimuli 
also being visible. Neither of these con- 
ditions obtain in the typical serial-learn- 
ing task, The variation in paired-associ- 
ate procedure used here, in which the 
Stimulus shutter closed just 25 the 
response shutter opened, more closely 
approximates the typical serial-learning 
task. The finding that this alteration 1n 
Procedure produced no detectable effect 
in paired-associate learning is consistent 
with the contention that the described 
difference between serial and paired- 
associate learning tasks is not an it 
Portant one, However, à more THORQNE 
Study of the parameters of this geri 

ural difference, using both serial an 
Paired-associate tasks, iS called for. 


REFERENCES 


Erickson, R. L. Relational isolation as a 
means of producing the von Restorff effect 
in paired-associate learning. J. exp. Psy- 
chol., 1963, 66, 111-119. 

KiMBLE, G. A., & Durort, R. H. Meaning- 
fulness and isolation as factors in verbal 
learning. J. exp. Psychol., 1955, 50, 361- 
368. 

NACHMIAS, J., GLEITMAN, H., & MCKENNA, 
V. The effect of isolation of stimuli and 
responses in paired associates. Amer. J. 
Psychol., 1961, 74, 452-456. 

Newman, S. E, & Sartz, E. Isolation 
effects; Stimulus and response general- 
ization as explanatory concepts. J. exp. 
Psychol., 1958, 55, 467-472. 

PiLLSBURY, W. B., & Raus, H. L. An ex- 
tension of the Koehler-Restorff inhibition 
phenomenon. Amer. J. Psychol., 1943, 56, 
293-298. 

Primorr, E. Backward and forward associa- 
tion as an organizing act in serial and in 
paired associate learning. J. Psychol., 
1938, 5, 375-395. 

Sartz, E, & NEWMAN, S. E. The von 
Restorff isolation effect: Test of the intralist 
association assumption. J. exp. Psychol., 
1959, 58, 445-451. 

Unperwoop, B. J. Studies of distributed 
practice: VIII. Learning and retention of 

ired nonsense syllables as a fuaction of 
intralist similarity. J. exp. Psychol., 1953, 
45,133-142. (a) 

Unperwoop, B. J. Studies of distributed 
practice: IX. Learning and retention of 
paired adjectives as a function of intralist 
similarity. J. exp. Psychol, 1953, 45, 
143-149. (b) 

UxpERwOOD, B. J., RuNQuisr, W. N., & 
Scuutz, R. W. Response learning in 
paired-associate lists as a function of intra- 
list similarity. J. exp. Psychol., 1959, 58, 
70-78. 

WiNER, B. J. Statistical principles in experi- 
mental design. New York: McGraw- 
Hill, 1962. 

Youxc, R. K. A comparison of two methods 
of learning serial associations. Amer. J. 
Psychol., 1959, 72, 554-559. 


(Received February 7, 1964) 


al of Experimental Psychology 
1969. Val Go Nee 3, 324-325 


SUPPLEMENTARY REPORTS 


FURTHER EFFECTS OF SUBJECT-GENERATED RECODING CUES 
ON SHORT-TERM MEMORY? 


RICHARD H. LINDLEY 


Eastern Michigan University 


48 men and women college students were teste 


(STM) study using the Peterson an 


Ss were tested with 4 retention intervals, high- 


(HM or LM) items, and 2 
preceding or following the trigram. 
the recoding cue; 24 recalled only th 
recoding cues in either order fa 
the trigram and the recoding cue h 
pared to recalling the trigram alone; 
cues was 93%, for HM cues 92%. 
This study follows u 
and Lindley (1964). 
(STM) for trigram- 
explored as a funct 


p a study by Schaub 
Short-term memory 
recoding cue pairs was 
ion of the two possible 
orders of item presentation (e.g., HONEY-HON 
and HON-HONEY) and of two different recall 
tasks (recall trigram or recall trigram and 
recoding cue). 

Method.—The 4X2X3X2 factorial 
design had four retention intervals (0, 8, 14, 
20 sec.), either high-meaningfulness (HM) 
or low-meaningfulness (LM) trigrams, three 
types of item presentations (two presenta- 
tions of trigram, recoding cue following 
trigram, recoding cue preceding trigram), and 
two types of retention task (recall trigram 
and recall trigram plus recoding cue). All Ss 
were tested with the same retention interv. 
levels of meaningfulness, 
presentation; half the Ss 
recall the trigram alone and the other half 
were instructed to recall both the trigram 
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als, 
and types of item 
were instructed to 


grant to the first 
Foundation. 


AND 
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Trinity University 
d in a short-term memory 


on (1959) technique. All 
or low-meaningfulness 


d Peters 


presentations of the trigram or recoding cue 


24 Ss recalled both the trigram and 
etrigram. Major results were (a) 


cilitated trigram recall; (b) recalling both 
ad no effect on trigram recall as com- 


(c) correct recall for LM recoding 


and the recoding cue in the order that they 
thought of them. sail 
One random order of the 48 items was used- 
The Ss had 12 sec. to recall the items anha 
6-sec. rest was given after the recall period. 
The items were the 24 LM and 24 S 
trigrams of Schaub and Lindley (1964) wit 
the corresponding common recoding Sai 
Forty-eight men and women paid volunteer 
from Trinity University served as 5s; they 
were assigned to the two conditions of Hen 
task alternately. In all other respects 64) 
procedure used by Schaub and Lindley (196 
was followed. " 
Results.—Table 1 presents the proportio 4 
of trigrams recalled correctly. The analys! 
of variance of these data (excluding the 0-sec 
retention interval) confirmed the Schaub i a 
Lindley (1964) study in essential dem 
meaningfulness, retention intervals, and i 
presence of recoding cues were all significan 
sources of variance, F (1, 782) (2, 782) (1, 78 
= 143.82, 6.63, 39.55, p < .01, respective m 
The order of trigram-recoding cue present 


ns 


TABLE 1 


PROPORTIONS oF TRIGRAMS 


RECALLED CORRECTLY 


Recall Trigrams Only 


Recall Trigrams and Recoding Cues 


Retention Intervals in Sec, 


Retention Intervals in Sec. 


DM HM. LM HM 
0 
Cond. Fe Bo SPE eu S E ae ww os ae Nae | a [a ga E 
Item Presentation 4 |1.00| .81 | .71 | 7 .67 
us i .98 | .69 | .58 | .54 | 1.00] . ‘oa | 11 | 100 | .67 | .48 | so | 1.00] .83 | .81 
Trgramstwie 556 1.65.62 | 265 | :98| 196 | 94 | “oh 98] ‘65 | “65 | ‘er | 100 | Sa | (37 5 
Cue Trigram 96 | -69 | -79 | .65 11.00] .83 | -87 | -83 | $96 | :67 | 3 | -63 1.00 | 96 | .92 | - 
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TABLE 2 
TRIGRAM AND RECODING CUE RECALL: PROPORTION oF TOTAL RESPONSES IN EACH CATEGORY 


LM HM 
Trigram Trigram Trigram Trigram 
Correct Incorrect Totals Correct Incorrect Totals 
Recoding cue correct 65 -28 93 90 02 .92 
Recoding cue incorrect .02 .05 07 .00 08 Tos 
Totals E 33 1.00 .90 10 1.00 


tion was not significant, F (1, 782) < 1; thus 
both of the trigram-recoding cue sequences 
facilitated recall. The sequence: trigram 
followed by recoding cue presumably mimics 
the normal sequence in which S searches for a 
recoding cue after the trigram has been 
presented. Inspection of Table 1 also shows 
that trigram recall is not affected by requiring 
the recall of the recoding cue along with the 
trigram, F (1,24) < 1; if Ss use the recoding 
cue to generate the trigram, then it should be 
immaterial whether or not the overt recall of 
the recoding cue is required. , 

The only other significant differences 
were Retention Intervals X Recoding Cues, 
F (1, 782) = 3.92, p < .05, and Meaningful- 
ness X Order of Item Presentation, F Q, 782) 
= 3.73, p <.05. The former interaction 1S 
significant because the rate of forgetting of 
trigrams with recoding cues present 1s slower 
than the rate when the trigram is presented 
twice; the latter interaction is significant be- 
cause the trigram-recoding cue sequence leads 
to superior recall with HM items and the 
recoding cue-trigram sequence is superior with 
LM items. j 

The 24 Ss who recalled both the trigram 
and the recoding cue aloud were instructed to 
recall them in the order that they thought o 
them; there were 118 out of a possible 57 
responses (excluding the 0-sec. retention 
interval) in which the order of recall w ast e 
reverse of the order of presentation. | 
uses the recoding cue to generate the rigent 
then the order of recall: recoding cue precedun. g 
trigram should be a more frequent type ol 
reversal than the order of recall: recoding me 
following trigram. This was the case; t ie 
mean number of the former type of ‘wor 
per S was 3.46, of the latter, 146, t ( 
= 4. B . 

Tee a ais the data for the four 
Possible combinations of correct or pa 
trigram recall and correct or incorrect reco: ing 
cue recall. The level of recall of the LM an 
IIM recoding cues is almost identical and 
most of the errors for the LM trigrams occur 
although the recoding cues have been correctly 
recalled. Presumably these two findings are 


due to the degree to which the first three 
letters of the recoding cue are in the same 
sequence as the letters of the trigrams. That 
is, for the typical LM combination (e.g., 
ZUJ-ZULU), if S recalls the recoding cue and 
spells the first three letters of it, he will make 
a mistake. On the other hand, for the typical 
HM combination (e.g. HON-HONEY), a 
similar strategy will lead to a correct recall. 
Thus the relatively poor recall of the LM 
trigrams appears to be a function of the ease 
with which the trigram can be generated from 
the recoding cue. The error data provide a 
further test of this hypothesis. 

“Decoding errors" were defined as three- 
letter errors in which the incorrect letter or 
letters of the error were present in the recoding 
cues; e.g., ZUL for ZuJ-ZULU. The three-letter 
errors from all 48 Ss were classified as either 
decoding errors or "other" errors; for the LM 
pairs there were 91 decoding errors and 79 
other errors; for the HM pairs the correspond- 
ing figures were 2 and 21, respectively. (For 
comparison, the number of errors classified as 
decoding errors from the conditions where the 
trigram was presented twice were 5 for LM 
trigrams and 3 for HM trigrams; the corre- 
sponding other errors in these conditions were 
69 and 31, respectively.) Thus the largest 
number of decoding errors are found in the 
condition where decoding the trigram from 
the recoding cue should be most difficult; 
ie, where the same order of letters is not 
present in both the trigram and the recoding 
cue. Finally, since the level of recall for the 
LM recoding cues is as high as for the HM 
cues, the difference in level of LM and HM 
trigram recall under the present conditions may 
be solely a function of the degree to which the 
rules for generating the trigram from the 
recoding cue are of an equal degree of 
complexity. 
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FIRST-LIST RETENTION AS A FUNCTION OF THE METHOD OF RECALL! 


JOHN P. HOUSTON; BERTRAM E. GARSKOF,; DALE E. NOYD, asp JANICE M. ERSKINE * 
University of Michigan 


CMT as " 
The present study compared amounts of retroactive inhibition (RI) 
SEd with the MMFR technique (Barnes & Underwood, 1959) and 
a single-list recall method. Following A-B, A-C learning Ss recalled 
either the responses from both lists or the responses from the 1st list. 


The results indicated that both recall methods resul 
RI but that the amounts of RI did not differ. 


In a recent study of first-list retention in 
the A-B, A-C and A-B, A-B’ paradigms, 
Postman (1962) found that the conventional 
anticipation method of measuring List 1 
recall resulted in measures of retroactive in- 
hibition (RI) which did not differ from those 
obtained with the MMFR technique devel- 
oped by Barnes and Underwood (1959). 
Since conventional anticipation of List 1 re- 
sponses involves competition from List 2 
responses, while the MMFR technique pre- 
sumably eliminates this factor, Postman 
interpreted the similarity between the 
amounts of RI as measured by the two meth- 
ods to mean that RI immediately following 
interpolated learning must be determined 
principally by List 1 associative strength and 
that response competition has little effect 
upon the magnitude of RI, The present 
experiment was designed to supplement this 
finding by comparing amounts of RI obtained 
with the MMFR technique and a single-list 
recall method. Following A-B, A-C learning, 
Ss recalled either the responses from both lists 
or the response from the first list. 

Method.—Three groups of 27 Ss learned 
two successive lists conforming to the A-B, 
A-C paradigm. A fourth group of 27 Ss 
learned only an A-B list. The A-B and A-C 
lists employed in the study were those used 
by Barnes and Underwood (1959). They 
were composed of eight nonsense syllables as 
stimuli and two sets of eight two-syllable 
adjectives as responses. The lists Possessed 
no apparent interlist or intralist similarity, 
The three pairings of the stimuli and responses 
used by Barnes and Underwood were also 
employed in the present study. Nine Ss in 
each condition were given each of the three 
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Pairings. An equal number of Ss in each 
group received each of the two lists as first 
and second lists. Three random orders of the 
pairs in cach list were used to minimize serial 
learning. The four conditions were ran- 
domized 27 times so that in 27 blocks of four 
‘Ss each condition occurred only once. The 
55 were assigned to these blocks in the ui: 
of their appearance in the laboratory. A 
lists were presented on a Stowe memory eum 
at a 2:2 sec. rate with a 4-sec. intertria 
interval. The Ss in three of the four condi- 
tions learned the A-B and A-C lists to à 
criterion of one perfect trial. A sd 
interval separated the two lists. A Contro 
group learned the A-B list to a criterion of one 
Perfect trial and then engaged in a maze- 
tracing task for 10 min. the approximate 
time taken by the remaining groups to learn 
the A-C list. Following A-C learning, the 
MMFR Group Ss were presented the stimuli 
on the drum and asked to recall both first- 
and second-list responses in the order they 
came to mind. The First-List Recall grouP 
was instructed to recall and say aloud only 
the List 1 responses. Second-List Recall group 
recalled only List 2 responses. Following the 
maze task the Control group Ss were asked to 
recall the A-B responses when presented with 
the stimuli. The three stimulus orders 
used during learning were presented equally 
frequently in all groups during the recall trial. 
Each stimulus was Presented for 12 sec. 
during the test trial. 
Results —The mean numbers of trials to 2 
criterion of One perfect trial on the A-B list 
Were 10.37, 10.78, 10.96, and 11.11 for the 
MMFR, First-List Recall, Second-List Re- 
call, and Control groups, respectively. The F 
for these values was not significant, F (3, 104) 
= .05, indicating that the four groups were 
comparable in terms of learning ability. 

The mean number of correct A-B responses 
recalled by the MMFR group was 4.44. The 
mean for the First-List Recall group was 4.48 
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and for the Control group 7.88. An analysis 
of variance applied to these means indicated 
a significant difference between them, 
F (2,78) = 49.54, MS error = 2.13, p < .001. 
The difference between the MMFR and the 
First-List Recall means was not significant, 
t (52) = .08. These results indicate that 
significant RI was demonstrated in both the 
MMER and the First-List Recall conditions 
but that the amount of RI as measured by 
these two techniques did not differ. 

Mean A-C recall for the MMFR group was 
7.74 while mean A-C recall for the Second- 
List Recall group was 7.70. The difference 
between these means was not significant, 
t (52) = 21. 

It is worth noting that the data of the 
present study indicate the reliability of the 
Barnes and Underwood (1959) findings. The 
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A-B and A-C recall means for both the 
MMFR and the First- and Second-List Recall 
groups closely parallel the means obtained 
by Barnes and Underwood at high levels of 
second-list learning. This agreement is par- 
ticularly interesting in the light of the fact 
that the present study involved fixed recall 
intervals rather than S-paced recall. The 
implication is that S-pacing procedures may 
be of little importance in relation to the un- 
learning effect. 
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A previously reported shift in 
It is suggested that change in 


to make judgments, and then to replace them 
between trials. To eliminate this unstable 
condition of adaptation, all Ss in the current 
study performed without goggles in a dark- 
ened room to which they had become adapted. 
Finally, to eliminate all possibility of head 
movement, a metal headrest extending 24 in. 
beyond the forehead was utilized. 

Method.—Twenty female undergraduate 
students were assigned at random to one of 
the four factorial conditions of right or left 
auditory stimulation and control-experi- 
mental order. Ten trials were administered 
in both experimental and control conditions, 
‘The control condition consisted of trials dur- 
ing which no auditory stimulation was 
presented. 

The Ss were required to adjust a luminous 
line segment by means of a knob until it ap- 
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peared to be vertical. The line segment, 1 
mm. wide and 6 cm. long, was 48 in. from S. 
Headrest apertures permitted Ss to look into 
the apparatus and perceive the line segment. 
The only source of illumination in the experi- 
mental room was a dim light in the back of 
the apparatus. This light illuminated a rod 
(connected to the line segment disk) which 
served to indicate angular displacement. This 
displacement was measured to the nearest 
0.25°. Auditory stimulation of .15 v. at 1,000 
cps was presented through two matched 
stereo headphones (8 ohms impedance). 

The method of adjustment was employed. 
Alternating ascending and descending series 
were utilized. The Ss were permitted to over- 
shoot and retrace until satisfied with their 
judgment of apparent vertical. Each S was 
given five practice trials to insure familiarity 
with the apparatus and to permit adaptation 
to conditions of ambient light (15-min. 
adaptation time). 

Results.—A control point of subjective 
verticality (PSV.) was computed for each S. 
For the experimental condition, an auditory 
stimulus point of subjective verticality (PSV,) 
was computed. A difference score, d, was 
computed for each .S (d = PSV, — PSV,). 
On the basis of the Werner et al. study, it was 
expected that this score would differ in sign 
for those Ss receiving left-ear stimulation as 
contrasted to those Ss receiving right-ear 
stimulation. Consequently, all d values for 
the left-ear stimulation condition were multi- 
plied by —1.0. Twenty d scores, representing 


mean differences between control and auditory 
stimulation, corrected for vector sign, were 
obtained. A simple £ test was computed for 
these data. The mean of the difference scores 
(d= .11°) was not significantly different 
from zero, ¢ (19) = .22. 

The possibility remained that procedural 
changes were responsible for increases in 
within-Ss variability. However, the control 
vs. stimulation correlation of .68 for the 
present study reflects a decrease, rather than 
an increase in within-Ss variability. Thus the 
sensitivity of the present study to small 
systematic differences is greater than for the 
Werner et al. study. 

Discussion.—The inability to obtain con- 
sistent shifts in visual perception of the 
vertical under conditions of unilateral audi- 
tory stimulation requires explanation. It is 
possible that the unifying factor underlying 
the Werner et al. studies is related to a shift 
in the physical attitude of the body. In the 
present study, the phenomenon is no longer 
observed when the head is fixed during 
judgments of verticality. Other procedura 
changes of the current study may also have 
contributed to the lack of a systematic shift. 
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1 visual 
Results 


with respect to visual discrimination, 
and the second and third experiments 
extended the investigation to tactual 
discrimination. 

Two general hypotheses about the 
nature of learning during improve- 
ment of discrimination can be identi- 
fied. One can be loosely termed a 
"schema" hypothesis, and is sug- 
gested in discussions of Bruner (1957a 
1957b), Vernon (1952, 1955), and in 
a recent book on perceptual develop- 
ment by Solley and Murphy (1960). 
Although these investigators deal 
primarily with identification behavior 
ie., recognition and categorizing be 
havior, their discussions implicate 
discrimination behavior as well. Ac- 
cording to this point of view, discrim- 
ination and identification involve 
matching sensory data or ‘“‘cues” 
about objects to prototypes or models 
of the objects which have been built 
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up through repeated experience with 
the objects and ‘“‘stored” in memory. 
Improvement in discrimination would 
involve first constructing schemata or 
models of the objects to be discrim- 
inated, and then matching the sensory 
data to the models so as to identify 
them as “same” or "different." Prac- 
tice with objects to be discriminated, 
then, would erable S to build up and 
refine the appropriate schemata. 

A second general hypothesis about 
improvement in discrimination, a 
"distinctive feature" hypothesis, is 
suggested by the Gibsons and their 
colleagues (Gibson & Gibson, 1955; 
Gibson et al., 1962). It utilizes the 
concept of distinctive features intro- 
duced by Jacobson and Halle (1956) 
in a discussion of phoneme character- 
istics. Distinctive features can be 
thought of as dimensions of difference 
which distinguish and provide con- 
trasts among objects. The hypothesis 
developed from the Gibsons' work is 
that improvement of discrimination 
consists of learning the distinctive 
features of the objects to be discrim- 
inated. The function of practice, 
according to this point of view, is to 
enable S to respond to an increasing 
number of stimulus variables and to 
discover which of these variables are 
"critical" in the sense that they serve 
to distinguish between one object and 
another. 

The physical conditions resulting in 
improvement of discrimination do not 
necessarily differ for these two hy- 
potheses. Both would predict that 
such improvement will occur as a 
function of practice with the objects 
to be discriminated. Hence a transfer 
design can be employed to determine 
the extent to which prototype learn- 
ing, distinctive feature learning, or 
both will occur during training. Spe- 
cifically, Ss could be presented 
with initially undifferentiated stim- 
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ulus forms and trained to discriminate 
among them. Ina transfer task they 
could then try to discriminate among 
stimulus forms which either differ from 
each other in the same dimensions as 
those which they learned to discrim- 
inate or which have the same proto- 
types as those which they learned to 
discriminate. Differential perform- 
ance in these two transfer conditions 
should shed light on the function of 
prototype learning and distinctive 
feature learning in improvement of 
discrimination. Since the two proc- 
esses are not mutually exclusive, 
relative transfer in the two conditions 
described can be determined by com- 
paring them to a control condition 1n 
which both the prototypes and dimen- 
sions of difference of the forms differ 
from those used in training. 


EXPERIMENT I 


Method 
Subjects 


The Ss were 60 kindergarteners. Each 5 
was randomly assigned to one of three equal 
transfer groups. There were an approxi- 
mately equal number of boys and girls among 
the 20 Ss in each group. 


Materials 


The stimulus forms were letter-like forms 
of the kind used in the developmental study 
of Gibson et al. (1962). The forms, approx! 
mately 1X1 in, were black and were 
mounted on rectangular white cardboard 
cards 3 X 4 in. " 

There were six standard forms and six 
different transformations of each of the stand- 
ard forms. These transformations were one 
change of a straight line to a curve, tW 
changes of straight lines to curves, a right-left 
reversal, a 45° rotation, a perspective trans- 
formation equivalent to a 45? backward tilt, 
and a 25% increase in size, The standar 
forms and six transformations for each are 
shown in Fig. 1. 


Procedure 


Training.—The training procedure was the 
same for all Ss. The S was seated in front of 
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A md table on which a stand similar to a 
Were nl was placed. Three standard forms 
contas aced on this stand and a pack of cards 
ree pe two copies of each standard and 
Spread ransformations of each standard was 
was (1 Qut in front of S on the table. The S 
nstructed to look carefully at each of the 
vat and decide whether it was exactly 
differe me as one of the standards or if it was 
Exact] nt. When S found one which was 
Was iis the same as one of the standards, he 
magi sttueted to give it to E. When S had 
(i.e, & Judgment about every card in the pack 
and’ trished one trial), E shuffled the cards 

"Cached. procedure was repeated until 
fave p a criterion of one perfect trial, - 
Confusion” the two copies of each standard. 
dise n errors (transformations phich 
ere ie Were exact copies of a standard) 
Which Bed: A correction procedure, in 
Tight 7 told S whether each judgment was 
Or wrong, was used on every trial 

rst. . 
Pract the first training trial, a pretraining 
Proceg, trial with real letters and correction 
with © Was carried out in order to acquain 
inde; the task and to ensure that they 
Sacre that only forms which E 

ou 

be give, Te amg as the standard s 
trial "emer -— Following the criterion training 
did the transfer procedure was carried out. 
me ee was the same as in training but only 
al was given and there was no correc: 
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tion given. Confusion errors were recorded 


as before. 

The Ss were divided into three transfer 
groups in a predetermined arbitrary order. 
These groups differed in terms of the par- 
ticular forms used in the transfer trial. Group 
C provided a base line with which to compare 
the transfer performance of the other two 
groups. This group received three standards 
and three transformations all of which were 
different from the ones used in training. For 
example, if in training these Ss had learned to 
distinguish copies of Standards A, B, and C 
from line to curve and size transformations of 
these standards, then their transfer task was 
to distinguish copies of Standards D, E, and 
F from rotation, reversal, and perspective 
transformations. 

Group EI reflected the extent to which 
standard or prototype learning had occurred 
during training. This group received the 
same standards as in training, but three new 
transformations of these standards. For 
example, if in training these Ss learned to 
distinguish Standards A, B, and C from line 
to curve and size transformations of these 
standards, then their transfer task was to 
distinguish these same standards from re- 
versal, rotation, and perspective trans- 
formations. 

Group EII reflected the extent to which 
distinctive feature learning had occurred dur- 
ing training. This group received three new 
standards, but the same three types of trans- 
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formations of these standards as those with 
which they dealt during training. For ex- 
ample, if in training these Ss learned to 
distinguish Standards A, B, and C from 
among reversal, rotation, and perspective 
transformations of these standards, then, in 
the transfer trial, their task was to distinguish 
Standards D, E, and F from reversal, rotation, 
and perspective transformations of these 
standards. 

In order to balance the design for possible 
differences in difficulty of discriminating 
specific combinations of standards and trans- 
formations, four subgroups of Ss were used in 
the training condition. Each was trained with 
a different combination of standards and 
transformations. One had Standards A, B, 
and C with line to curve and size transforma- 
tions. Another had Standards D, E, and F 
and these same transformations. A third sub- 
group was trained with Standards A, B, and 
C and reversal, rotation, and perspective 
transformations, and a fourth had Standards 
D, E, and F and these transformations. 

'There were also, of course, 4 subgroups 
within each of the 3 transfer groups since the 
combination of forms used in transfer for a 
given S depended on the combination of forms 
used in training. Thus there were 12 transfer 
subgroups with five Ss in each. 


Results 
Training 


Differences between the means of 
the groups in number of confusion 
errors made on the first trial and in 
number of trials to criterion were 
analyzed using ¢ tests. No differences 
approached the .05 level of significance 
and hence the null hypothesis that 
these groups were from the same 
population could be accepted. 


Transfer 


Confusion errors in the transfer 
trial constituted the main data of the 
experiment. Table 1 shows the errors 
made in the transfer trial by each 
of the 12 subgroups and for the 
three transfer groups with subgroups 

ined. 
"i analysis of variance was eer 
formed on the data for the 12 sub- 
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TABLE 1 


CONFUSION ERRORS IN TRANSFER TRIAL 
FOR THE 12 SuBGROUPS 


Transformations 
T 123 | T 456 
Group | Total 
Standards 
SABC | SDEF | SABC | SDEF 
EL | i | 2b | i$ 2 
EH | ii 6 | 10 | 12 | 39 
c 23 | 21 | 32 | 25 | 101 


n " nes icular 
Note.—Column headings indicate the particular 

combination of standards and transformations use! 

the transfer trial. 2 = 5 in each subgroup. 


groups. Only the effect of transfer 
groups was significant, F Q, 48) 
— 12.69, p « .001. Thus the differ- 
ences among the three experimental 
groups obtain regardless of subgroups: 
i.e., regardless of the particular com- 
bination of standards and trans 
formations used. 

The differences between the three 
groups in transfer errors were analyze 
with ¢ tests. The three groups Were 
significantly different from each other 
with a probability level of less than 
.01 with a two-tailed test. 


Discussion 


The results of this experiment suggest 
that learning distinctive features may 5 
a significant component of improvement 
in visual discrimination of letter-like 
forms. The Ss who in the transfer tria 
dealt with forms which they had never 
seen before but which varied from eac 
other in familiar ways (the EII group 
made the fewest confusion errors. This 
suggests that during the training trials 
Ss were learning how the forms varied 
from each other as they improved i? 
their ability to discriminate among them. 

The fact that Group EI, the grouP 
having familiar standards and new trans- 
formations, was superior to the control 
group suggests that prototype learning 
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also occurred during training. However, 
the clear superiority of Group El implies 
that such learning may not be essential 
to improvement in discriminations of 
this sort. 


EXPERIMENT II 


The purpose of this second experi- 
ment was to investigate the generality 
of the results of the previous experi- 
ment for improvement in tactual 
discrimination, Adaptations of the 
procedure and materials of the previ- 
ous experiment were made in order to 
Provide appropriate conditions for 
Studying improvement in tactual dis- 
crimination. These adaptations are 
noted below. The basic method and 

esign of this experiment were the 
Same as in the previous one. 


Bun Method 

ubjects 

"m. Ss were 72 first graders. First-grade 
children were used as Ss because pilot work 


indicated the task was better suited to this 


ane group than to the kindergarteners used 
Previously, 


Materials 
, The stimulus forms were metal reproduc- 


i > S 
Ons of some of the letter-like forms used in 
© previous experiment. The forms (x1 
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in. were made by an engraving process and 
were raised lines on a smooth square metal 
background about 14 X 11 in. 

There were four standard forms and 10 
transformations of each: one, two, and three 
changes of lines to curves, a 25% increase in 
size, two topological transformations: break 
and close, a right-left reversal, 45° and 90° 
rotations, and a perspective transformation 
equivalent to a 45° backward tilt. The 
standard forms and their transformations are 
shown in Fig. 2. 


Procedure 


Training.—The S was seated at a small 
table on which a form board was placed. This 
plywood board 12 X 15 in. with a raised 
block in the middle, was used to display the 
forms in front of S. The raised block was 
covered with “velero” as were the backs of 
the metal forms and the forms could thus be 
made to adhere to the block for presentation 
to S. The S was blindfolded and a standard 
form and one of its transformations were 
placed on the board. The standard was 
always presented on the left and the trans- 
formation on the right. 

The S's dominant hand was placed on the 
board and he was instructed to feel first one 
form and then the other with that same hand 
and decide whether the two forms were the 
same or different. When S had made a 
judgment about the pair of forms, the trans- 
formation was removed and replaced by 
another. After a few presentations, E no 
longer had to guide S’s hand to the board. 
A trial consisted of seven presentations for 
comparison: one standard form to be com- 
pared with each of five transformations and 
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two copies of the standard in random order. 
When S completed a trial, the procedure was 
repeated in a different random order until S 
reached the criterion of learning This 
criterion consisted of either a perfect trial or 
a single confusion error for either the size 
or perspective transformation (whichever the 
particular S was given). This weaker second 
condition was accepted as a criterion condi- 
tion because size and perspective transforma- 
tions proved in pilot work to be impossible 
for many Ss to distinguish from standards 
long after they were able to distinguish the 
other transformations from standards. 

Preliminary practice prior to training 
consisted of practice with visual letters fol- 
lowed by both visual and blindfolded practice 
with letters in the form of tactual stimuli. 

Transfer.—The same three transfer condi- 
tions as before were used. Group C received 
a different standard and five different trans- 
formations of the standard from the ones used 
in training. Group EI received the same 
standard as in training, but five new trans- 
formations of this standard. Group EII re- 
ceived a new standard, but the same five 
transformations of this standard as those 
with which they had dealt in training. 

Eight subgroups were used in the training 


*'The training task proved too easy for 
some first graders. Since this was a learning 
experiment, Ss who performed the training 
task successfully on the initial trial were 
discarded from the experiment. There were 
also a few Ss who were never able to learn the 
training task and who were also discarded. 
In all, about 100 Ss were run through the 
training procedure. The final N represents 
those Ss who did not perform at criterion 
level on the first trial and who were eventually 
able to do so after several training trials. 
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condition representing training with each of 
the four standards and the two groups of 
transformations. One transformation group 
consisted of line to curve transformations, the 
size transformation, and the close. The other 
group consisted of the reversals, rotations, 
perspective transformation, and the break. 


Results 
Training 


Differences between the groups 1n 
number of confusion errors made on 
the first trial and in number of trials 
to criterion were analyzed with / tests. 
None of the differences approached 
significance. 


Transfer 


Confusion errors made by the sub- 
groups and for the three groups wit 
subgroups combined are shown n 
Table 2. The analysis of variance 1T 
dicated a significant effect of transfer 
groups at less than the .001 proba- 
bility level, F (2,48) = 10.63. The 
Transformations X Standards inter- 
action and the triple interaction were 
also significant at less than the .05 
level of probability. A comparison 9 
the total errors made by the three 
groups indicated that Groups EI an” 
EII were not different from eac? 
other but both were superior in Pet 
formance to Group C. 


TABLE 2 
CONFUSION Errors IN TRANSFER TRIAL FOR THE 24 SuBGROUPS 
Transformations 
T 12345 T 6789 & 10 
Group Total 
Standards 
SA SB SD SE SA SB sD SES | 
2 24 
I 3 8 2 2 E 1 1 3 
Et 3 2 3 3 3 3 5 3 25 
E 9 9 9 2 2 5 5 7 25 


Note.— Column headings indicate combination of forms used in the transfer trial. n = 3 in each subgrouP* 
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_ The ‘Transformations X Standards 
interaction effect seemed to be due to 
a difference between errors made to 
Standard E and errors made to the 
other standards (cf. Fig. 2). For 
Standards A, B, and D more errors 
Occurred with Transformations 1-5 
than with ‘Transformations 6-10. 
This pattern is reversed with Standard 
E which probably accounted for the 
fact that the effect of transformations 
id Not reach significance. 
c “~ three-way interaction is diffi- 
wt to interpret meaningfully. Prob- 
oly its effect was accounted for by 
Sener that there were only three Ss 
cell cell, and one atypical S in a given 
t could account for the scores 1n 
at cell deviating from the overall 
Pattern, = 


Discussion 


Peet the results of this experiment 
experi ifferent from those of the previous 
Brou iment. In that experiment, the 
transf Which had opportunity to use Im 
istine what it had learned about the 
Superis features of the forms was 
transi: to both the other groups in 
nt eg, trial performance. In the pres- 
3 wb rifuent, the comparable group of 
orman, no better in transfer trial per- 
Portun than the group which had op- 
Mode] ity to use prototypes or memory 
Were S of the forms, though both groups 
ne Etter than the control group- — 
that «quterpretation of these results 15 
Visual © process of improvement 5 
Proceg, iscrimination is different from tne 
Crim; Of improvement in tactual dis- 
tig, "ation and that schema construc- 
nd detection of distinctive features 
Ratna ae equally useful functions in 
nou IScrimination. 
dig, ether interpretation is related to 
Be Nees in the task required ol 
Plor Se S in the tactual experiment ex- 
Made the forms with only one hand, he 
V0 Successive comparisons- This task 


him memory and perhaps required 
Stan orm a memory model of the 
ard even if his task in learning WaS 
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to detect distinctive features. In the 
visual experiment, however, S could look 
back and forth between the comparison 
forms without having to remember how 
they looked in order to discover the 
differences between them. 

If this second interpretation is correct, 
making the tactual comparison simul- 
taneous should then result in the reap- 
pearance of superior performance by 
Group EII. The third experiment was 
conducted to test this hypothesis. 


EXPERIMENT III 


The major difference between this 
experiment and the previous one is in 
the nature of S’s task. There were 
minor differences in number of Ss, 
material, and subgroups as noted 


below. 
Method 


Subjects —The Ss were 60 first graders. 

Materials —The forms were Standards A 
and D and the 10 transformations of each. 

Procedure.—Yhe procedure was the same 
as in the previous experiment except that 
throughout this experiment, S explored the 
two comparison forms simultaneously, one 
with each hand.? 

There were four subgroups representing 
training with each of the two standards and 
the two groups of five transformations each. 


Results 
Training 
None of the differences between the 
groups in either number of trials or 
errors in the first trial approached 
significance. Hence the groups could 


3 Some Ss could not complete the training 
task successfully and were discarded. Their 
inability to perform successfully appeared to 
be a function of an inability to coordinate 
both hands in exploring the forms. There 
were also a few Ss who performed the task at 
ion level on the first trial, and were 

About 80 Ss began the training 
and the final N, as in the previous 
t, represents those Ss who did not 
criterion level on the first training 
ho eventually learned the task. 


criteri 
discarded. 
procedure 
experimen 
perform at 
trial and w 
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TABLE 3 


CONFUSION ERRORS IN TRANSFER TRIAL 
FOR THE 12 SUBGROUPS 


Transformations 
T 12345 T 6789 & 10 
Group Total 
Standards 

SA sD | SA SD 
EI 10 11 6 4 31 
EH 3 3 3 0 9 
€ 10 10 5 7 32 


Note.—Column_ headings indicate combination of 
forms used in transfer trial. » = 5 in each subgroup. 


be considered equivalent with respect 
to their ability to handle the forms. 


Transfer 


Transfer trial errors for each sub- 
group and for subgroups combined are 
shown in Table 3. The analysis of 
variance performed on the data 
showed the effect of transfer groups to 
be significant with a probability of 
less than .001 that the effect was due 
to chance, F (2,48) = 8.62. The 
effect of transformations was also 
significant with a probability of less 
than .01 that the effect was due to 
chance. Neither the effect of stand- 
ards nor any interaction was sig- 
nificant. 

A comparison of the total errors 
made by the three groups showed that 
Group EII, the group having new 
forms but familiar transformations in 
the transfer task, was superior to the 
other two groups in performance on 
this task. Group EI, the group hav- 
ing familiar forms but new trans- 
formations in the transfer task was 
not different from the control group. 

The effect of transformations is due 
to the fact that more errors occurred 
with Transformations 1—5 than with 
Transformations 6-10. Except for 
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one standard (E) this was also true in 
Exp. II. Apparently line to curve 
transformations are, in general, more 
difficult to discriminate tactually than 
rotations and reversals. 


Discussion 


These results support the hypothesis 
that under conditions of simultaneous 
comparison, the group having oppor- 
tunity to use in transfer what they had 
learned about the distinctive features of 
the forms would show superior perform- 
ance. Not only was this group (El) 
superior to the other two groups, but the 
other experimental group showed no 
better performance than the control 
group. In this experiment, the construc- 
tion of schemata, if such a process oc- 
curred at all, showed no effect in the 
transfer task. 


GENERAL DISCUSSION 


A consideration of the tasks involved 
in these experiments may make the three 
different patterns of results meaningful. 
The third experiment involved a task o 
simultaneous comparison. The results 
suggested that Ss had, in training 
learned the distinctive features of the 
forms since the superior group had no 
opportunity to construct schemata of the 
forms used in the transfer task. Those 
Ss who could use schemata in the transfer 
task performed no better than the con- 
trol group. k 

The second experiment involved a tas% 
of successive comparison. These Ss aP- 
parently both constructed schemata © 
the forms azd learned distinctive wd 
since groups who could use either I$ 
tinctive features or prototypes showe 
similar amounts of transfer relative tO 
the control group. k 

The first experiment involved a am 
which might be considered to lie betwee 
the tasks of the two tactual experiments 
in terms of the nature of the comparison: 
None of the Ss in this experiment ha 
to explore one form thoroughly before 
exploring the comparison form as did Ss 
in the second experiment. On the other 
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hand, Ss in this first experiment probably 
did not receive information from both 
the standard and comparison forms 
simultaneously as did Ss in the third ex- 
periment. Most likely, Ss in this first 
experiment quickly looked back and 
forth several times between the standard 
and comparison form in order to make a 
Judgment about them. The Ss who 
Could use in transfer what they had 
learned about distinctive features showed 
the best transfer task performance. 
hose Ss who could use schemata also 
Showed transfer but significantly less 
than the other experimental group. 
eo terms of the tasks involved in these 
i ia one | might interpret the 
of disti as suggesting that the detection 
iie s inctive features will always facili- 
that AS Mains cu] in discrimination but 
Dariso nder conditions of successive com- 
eL. schema construction will i- 
Pendently facilitate such improvement. 
that d: parsimonious interpretation is 
may M detection of distinctive features 
Chios the basis for improvement in dis- 
iue When such detection is 
Nature ent on memory because of the 
of the task (e.g., in Exp. II and to 
een degree in Exp. I), some schema 
requires does occur. When no memory 
Exp, In is imposed by the task (e.g., 
Occur D» schema learning does not 
feature n short, detection of distinctive 
Cient B may be the necessary and suffi- 
iscrin Condition for improvement in 
"nation; Schema learning may or 
mener occur depending on the experi- 
its fii ens. When it does occur, 
arison 1on is to make possible the com- 
9 mak and search for differences, 1€ 
tive fe * possible the detection of distinc- 
atures, 


T : 
he data of the present experiments 
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are consistent with this interpretation. 
In no case did an EI group perform 
better than an EII group. Furthermore, 
the EI groups showed better performance 
than the control group only to the extent 
that memory was involved in the task 
of comparison. 

Further research is necessary to estab- 
lish the validity of this interpretation. 
A direct test might be made by deter- 
mining whether, under conditions of 
successive comparison, Ss in fact have 
learned a prototype or memory model of 
the standard forms. Can they identify 
the given standards from a group of un- 
related forms, or can they reproduce the 
standards better than Ss who have 
operated under conditions of simultane- 
ous comparison? 
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CUE AND SECONDARY REINFORCEMENT EFFECTS 


WITH CHILDREN ! 


JOSEPH B. SIDOWSKI, NORMAN KASS, anp HELEN WILSON 
San Diego State College 


Groups of children served in a lever-pulling experiment in order to 
determine the secondary reinforcement, cue, and combined secondary 
reinforcement/cue effects of a light stimulus. On each conditioning 
trial the light was differentially associated with the presentation of a 
primary reinforcement. After conditioning, Ss underwent a period of 
extinction during which time a lever pull resulted in a light or a no-light 
condition. An evaluation of the number of responses during extinction 
showed a significant difference between the secondary reinforcing and 
the cue effects of the light stimulus. The discussion covered a need for 
adequate experimental controls in studies attempting to show a “real” 


secondary reinforcement effect. 


The purpose of the present study 
was to investigate the cue and 
secondary reinforcement (S*) effects 
of a stimulus in an experiment utiliz- 
ing children as Ss. Cue properties of 
a stimulus were indicated when there 
was a tendency for a response to occur 
following the presentation of a stim- 
ulus. The stimulus showed rein- 
forcing properties when there was an 
increase in the strength of some re- 
sponse that preceded the onset of the 
stimulus. 

Although the cue and S effects of a 
stimulus have been discussed for a 
number of years, relatively few studies 
have been directed toward experi- 
mentally differentiating these effects 
(Bugelski, 1956; Elam, Tyler, & 
Bitterman, 1954; Melching, 1954; 
Wyckoff, Sidowski, & Chambliss, 
1958). The lack of research in this 
area has been especially prevalent in 

1 This study was supported by funds from 
National Institute of Mental Health Grant 
MH 06806-01. The authors wish to express 
their appreciation for the cooperation of 
Burton Tiffany, Superintendent of Schools, 
James Spencer, Principal, and the teachers of 
Lauderbach Elementary School at Chula 
Vista; William Link, Principal, and the 
teachers of Kellog Elementary School; and 
Robert Nardelli, San Diego State College 
Laboratory School. 


human learning where the emphasis 
has been upon substantiating the 
existence of St and upon defining some 
of the variables influencing its strength 
(Hubbard, 1951; Myers, 1958; Myers 
& Myers, 1963). À 

Bugelski (1956) criticized conven” 
tional animal Skinner box tests of 
on the grounds that the assume 
secondary reinforcing signal may be 
evoking or facilitating the next lever 
response, rather than acting aS ? 
reinforcement for the preceding one: 
Wyckoff, Sidowski, and Chambliss 
(1958) found no indication of seconc^ 
ary reinforcing effects in either of tW? 
rat studies evaluating the relationship 
between the St and cue effects of ? 
stimulus. The results of the ees 
study were discussed with regard t ; 
the failure of other Es to control yd 
cue error and suggest implied S* effec 
to be due to the cuing of the stimulus. 
Recent reviews discuss the concept M 
detail and agree on a need for som 
clarification of the process (Myer®: 
1958; Wyckoff, 1959). D 

The present study was designed To 
differentiate the potential cue an 
effects of a light stimulus during the 
conditioning of a lever-pulling 1 
sponse and to measure these effect 
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during extinction. If S* effects exist 
and/or are unconfounded by cue 
effects, we would expect greater 
resistance to extinction for St in a 
design which allows for a differentia- 
tion of the two phenomena. 


METHOD 


Subjects —The Ss were 140 kindergarten 
children between the ages of 5.3 and 6.3 yr. 
Equal numbers of males and females were 
randomly assigned to each experimental 
condition, 

, Abparatus.—The S's panel consisted of a 
Simulated slot machine. A handle, mounted 
9n the right side, could be pulled down to 
"ispense pennies into a cup at the bottom 
of the front panel. A plastic hemisphere 
dome) protruded from the center of the 
qichine. Two 74-w. red lamps were mounted 
“rectly behind a frosted upper portion of the 
ami " The lower half of the dome was clear 
thr displayed a large number of pennies 
ol Tughout theexperiment. The presentation 
rec ight stimuli, dispensing of pennies, and 
dier ng of responses were controlled by 
tiene Jae programing and recording equip- 
ex located in a room to the rear of the 
E Perimenta] room. : 
"is large pegboards displayed approxi- 
consie, 50 prizes to each The prizes 
and Sted of toy watches, rings, parasols, fans, 

Other toy items of interest to children. , 
rocedure-—The E took each child from his 


Clas A 
a assroom to an experimental room located in 
t quiet section of the building. The $ was 


machin he was going to play the ‘penay 
app, ine game” and stood in front of the 
vi nier The E pointed to the pile a 
le es located in the dome and told S E 
Could" to try to obtain as many as he posstbly 
ispla he S was then shown the eatr 
be 12% Of prizes and told that the toys cou 
requir sit with the pennies. Each S ys 
like pa to indicate the prize that he wou e 

esei obtain, following which he was s!v° 
instructions: 


y 
Now I will show you how to play the RU 
cont” You pull the handle, the penni? 
ope Out here. You can play the game E 
to S&S You like, Tell me when You Y 
you Ps Remember, the more p bx 
changet out of the machine, the, s A 
opace you have of obtaining the prize tha 
u want, 
r experi- 
rcement 
and 


an 
mina ‘Ss were assigned to one of fou 
e 8 Conditions: primary reinfo 
> Secondary reinforcement (S°), Cue, 
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combined St/Cue effects. The P" group 
received a penny each time the lever was 
pulled to the “down” position. The S" group 
had a red light associated with the presenta- 
tion of the penny during conditioning. The 
light was off until S pulled the handle to the 
down position; it automatically illuminated 
.5 sec. before the penny was dispensed and 
remained lit until the penny appeared in the 
cup, 1 sec. In the Cue group, the red light 
was presented at the beginning of each condi- 
tioning trial, i.e., while the handle was still in 
an "up" position. The light remained illu- 
minated until S pulled the handle down, and 
was turned off before the penny appeared in 
the well. In the S*/Cue group, the red light 
was presented while the handle was in the 
“up” position and remained illuminated until 
the lever was pulled down and the penny 
appeared in the well. It then went “off” until 
the next trial. Thus, the effects of reinforce- 
ment and cue were confounded. 

Immediately after the 10 conditioning 
trials, Ss in the S*, Cue, and S'/Cue conditions 
were subjected to one of two extinction 
conditions, Light or No Light. The red light 
was illuminated following the responses of the 
Light-only extinction groups; responses made 
by the No-Light groups were followed by 
neither pennies nor light. The Ss in the Pr 
group served as an additional control and 
continued to receive pennies for lever pulling 
during the extinction interval. The design, 
therefore, had seven subgroups. 

Twenty Ss (10 male and 10 female) served 
in each of the seven extinction conditions. 
The temporal relationship of the light to the 
response was the same during extinction as it 
was during conditioning, e.g., the light came 
on for the St group only when the lever was in 
the “down” position. Each S in all groups 
played until he indicated a desire to stop, or 
until he completed 100 extinction trials. 

Comments and questions from S were 
ignored by E If S persisted in asking 
questions, E merely said: "You can play the 
game as long as you like. Téll me when you 
want to stop playing." This was generally 
sufficient. At the end of the session S was 
told that he would receive his toy after every- 
one in his room had played the game. 


RESULTS AND DISCUSSION 


The raw scores of number of re- 
sponses during extinction were trans- 
formed into log values to reduce 
heterogeneity of variance. Table 1 
presents the mean log number of 
responses during extinction for each of 
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TABLE 1 
MEAN Loc NUMBER or Responses 


T 

| S | Cue | S/Cue | Pr 

Light 119 | .80 | 94 | ii 
No light 98 -76 7S | 


the conditions. An analysis of vari- 
ance on the log values indicated a 
significant difference between groups, 
F (6, 133) = 8.15, 5 < .01. Subse- 
quent Duncan’s test for ranked means 
showed the P* continuous penny group 
producing significantly more lever- 
pulling responses than any extinction 
condition, p < .01. The S'/Light 
subgroup showed greater resistance to 
extinction than the Cue/Light, Cue/ 
No-Light, or S'/Cue/No-Light condi- 
tions, indicating Strong support for 
a reliable St effect. No other sta- 
tistically significant differences were 
found. 

For the St subgroups, classical sec- 
ondary reinforcement literature would 
lead us to expect more resistance to 
extinction for the Light condition. 
The nonsignificant difference between 
Light and No Light is difficult to 
explain. An inspection of the original 
raw score data showed a definite 
separation of the subgroups in the 
number of responses above and below 
the median, 8. A median test (Siegel, 
1956) showed these differences to be 
significant, p < .01, thus supplying 
additional support for the secondary 
reinforcement hypothesis. 

Generally, the results of the present 
study showed that: (a) the light 
served as an S when it was associated 
directly with the presentation of the 
Pr, (b) if the light was introduced only 
while the lever was being pulled down, 
ie. slightly prior to but not during 
the P* interval, the number of re- 
sponses during extinction T yes 
cantly fewer than resulted from the 
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simultaneous presentation of the 
and light, and (c) a combination of St 
and cue (a common practice in many 
Sr experiments) sustained lever-pull- 
ing behavior at a level somewhere 
between the pure S* and pure cue 
conditions. Unfortunately, published 
studies often fail to report the exact 
specifications of the temporal sequence 
of the light (or sound), although this 
appears to be an important determiner 
of cue and S effects. If an E intro- 
duces a stimulus and response strength 
increases or remains high, then the 
experimental procedure must show 
that this increase was not due to a cue 
effect before citing the result as evi- 
dence for secondary reinforcement. 
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TEMPORAL DETERMINANTS OF A 
KINESTHETIC AFTEREFFECT 


G. SINGER axp R. H. DAY 


University of Sydney 


The development of a kinesthetic aftereffect as a function of stimulation 
times of 15, 30, 45, 60, 75, 90, 105, and 120 sec. and its dissipation after 
30 and 90 sec. stimulation have been investigated in 2 experiments. 
The task required kinesthetic judgment of the horizontal following 
controlled rhythmic stimulation of the extended hand across a slanted 
bar. In Experiment I the aftereffect shown to increase with stimu- 
lation time. In Experiment II the size of the aftereffect was similar 
immediately following 30 and 90 sec. stimulation but the rate of dissipa- 
tion was greater for the shorter than for the longer stimulation. Since 
with the method of adjustment dissipation is rapid during the adjust- 
ment period, the development function of Experiment I is interpreted as 
representing a joint function of stimulation time and differential 


dissipation rates. 


Following appropriate stimulation 
of those sensory systems capable of 
Spatial discrimination, changes occur 
' judgments of size, shape, orienta- 
10n, Or position. These spatial after- 
etiects are now known to occur in 
Vision, hearing, kinesthesis, and in the 
Vestibular and tactile systems. In 
hi they have been extensively 

Udied as the figural aftereffect. 

4 Tom the viewpoint of a general 
icy it is of interest to establish the 
tional relationships between these 
td and intensity, spatial, and 
detec variables. The temporal 
tom minants of the spatial aftereffect 
me , kinesthetic stimulation have 
and. investigated by Bakan, Myers, 
~ Schoonard (1962), Bourne and 
“ler (1961), and Carlson (1963), all 
in hom used the now traditional task 
bar bere Judgments of the width of a 
on, efore and after stimulation by 
© of different width. 
T n an earlier study (Day & Singer, 
Consistently large (4-5°) and 
J'é kinesthetic aftereffects were 


d in a task requiring manual 


onents of the horizontal after 
tion of the extended arm and hand 


across a slanted bar. This task was 
suggested by the investigations of 
Gibson (1933) and actually used by 
Köhler and Dinnerstein (1947) who 
reported no quantitative data. So 
far little quantitative data are avail- 
able on the determinants of this par- 
ticular form of the kinesthetic after- 
effect. It is of interest, however, to 
establish the relationship between 
this easily measurable aftereffect and 
those variables which are known to 
affect other kinesthetic aftereffects. 
The aim of the experiments reported 
here was to establish the relationship 
between the magnitude of this kines- 
thetic aftereffect and time of stimula- 
tion on the one hand, and its dissipa- 
tion over time on the other, 


EXPERIMENT I 


The first experiment was concerned 
with the relationship between the size 
of the kinesthetic aftereffect and 
duration of stimulation, i.e., with the 
development of the effect with stimu- 
lation. In earlier experiments using 
the width judgment task (Bakan et 
al., 1962; Bourne & Beier, 1961; Carl- 
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son, 1963) three to five stimulation 
periods were used. Here eight dura- 
tions ranging from 15 sec. to 120 sec. 
were used in order to obtain as de- 
tailed a function as possible. 


Method 


Apparatus.—The apparatus was adapted 
from similar equipment used in an earlier 
investigation (Day & Singer, 1964). It con- 
sisted of a metal lumber frame 46 in. high, 
24 in. wide and deep in which was mounted a 
centrally pivoted 12-in. long bar in S’s frontal 
plane. The angle of bar could be adjusted 
by S or by E using suitably placed rotary 
controls operating through a 12:1 reduction 
gear. The angle of the bar could be read to 
the nearest .25° from an 8-in, diameter pro- 
tractor scale at the rear of the frame. 

Subjects.—Ten high school students (two 
boys, eight girls) whose ages ranged from 13 
to 17 yr. (median age 14 yr.) acted as Ss. No 
S was familiar with either the task or with 
kinesthetic aftereffects, 

Experimental procedure and design.—A trial 
consisted of a pretest, a stimulation period, 
and a posttest. In the pre- and posttests, .S, 
wearing opaque goggles, adjusted the bar with 
the left-hand control so that it seemed hori- 
zontal to the right hand moving at a preferred 
rate across its upper edge. To minimize the 
time taken to make an adjustment the bar 
was always set by E at the true horizontal 
prior to each pre- and posttest. Stimulation 
consisted of rhythmic side-to-side motion of 
the right hand across the upper edge of the 
bar slanted at 15? from the true horizontal. 
Hand motion was made in time with a 
metronome set at 84 beats per min. The 
difference between pre- and posttest adjust- 
ments in degrees was the measure of the after- 
effect. On completion of stimulation .S re- 
moved his hands from the apparatus while E 
quickly adjusted the bar to the physical 
horizontal. 

Following five practice adjustments and 
appropriate instructions each .S underwent 
two trials for each of eight stimulation periods 
of 15, 30, 45, 60, 75, 90, 105, and 120 sec. 
duration. Earlier observations had shown 
that motion of the extended hand across the 
slanted bar could not be sustained without 
discomfort for more than 2 min. In one trial 
of each pair the bar was slanted upward on S’s 
ight during stimulation and downward in the 
E the order of presentation of these 
e od alternating from S to S. The 
eight pairs of trials were presented in a differ- 
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ent random order for cach S with a 3-min. rest 
period between each trial. The 3-min. in- 
terval was chosen since preliminary observa- 
tions had shown that the effect had almost 
completely disappeared after this period. The 
possible occurrence of a cumulative aftereffect 
With repeated trials seems doubtful since the 
investigation of Heinemann (1961). The 
time taken by Ss to complete the posttest 
adjustment was recorded and found to have a 
mean of 17 sec. Each session lasted about 
90 min. 


Results 


The mean differences between pre- 
and posttests each based on two trials 
per 5 are shown for the eight stimula- 
tion periods in Table 1 along with 
their SDs. 

The relation between size of after- 
effect and stimulation time may be 
described as exponential. The func- 
tion y 244 B(1— c=)/(1 — C) 
describes a family of monotonic curves 
ranging from linear (in the limit) to 
curvilinear, A computer program! 
exists for finding by iterative means 
estimates of the parameters A, B, and 
C for that one of the family or curves 
Which is the best fitting line in the 
least-squares sense, Thus the pro- 
gram considers the linear fit as well as 
a family of curvilinear fits. 

In the above expression, x is an 
ordinal number listing equally spaced 
values of stimulation time 1,2.-- 
J--.1, y is the value of the after- 
effect at the jth value of stimulation 
time, A is the value of the aftereffect 
for the first stimulation time, B is the 
change in the aftereffect between the 
first and second stimulation times (15 
and 30 sec.), and C is the constant 
ratio of successive changes. 

The least-squares estimates of A, B, 
and C were found for each individual 


! The FORTRAN computer program for 
curve fitting was developed by J. P. Sutcliffe 
whose assistance in the treatment of these 
data and in the preparation of this report is 
gratefully acknowledged. 
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TABLE 1 
MEANS AND SDs OF DIFFERENCES BETWEEN PRETESTS AND 
POSTTESTS FOR THE Two EXPERIMENTS 
Exp. I 
Stimulation Times (Sec.) 
15 30 45 60 75 90 105 120 
X 1.80 1.76 2.22 2.60 2.72 2.52 3.04 3.11 
SD 1.29 1.58 1.47 1.71 1.44 2.22 2.14 2.15 
Exp, II 
Time after Cessation of Stimulation (Sec.) 
Cond. 
0 15 30 45 60 75 90 105 120 180 
PET 5 0.18 | +0.83 | —0.02 | +0.09 | +0.30 
X .01 0.59 | 4-0.19 | 4-0. s —0.02 j 2 
30 sec. | $5 | E29 109 | 132 | tose 0.98] 112| 108| 084! 13$ "os 
sec, | X : : 2.73 | +1.30| +1.78 | +1. 92 | 4-0. 64] —0. 
0 sec. itio Ties Taz) tiso 188| 186]  127| 132| 135| 200 


Sand for the mean data of the group. 
n the first case (mean parameter 
Curve) the general function for the 
average values of A, B, and C is 
Specified and, in the second case (mean 
Performance curve) the curve is based 
9n the 4, B, and C values established 
Tom the group means. N 
he mean performance curve is 
Shown in Fig. 1 as a continuous line. 
his plot suggests that after 2 min. 
Stimulation the maximum aftereffect 
ad not been reached. The mean 
Parameter curve is shown as a dotted 
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STIMULATION TIME (SEC) 
ti Fic. 1. Kinesthetic aftereffect as a fae 
m of stimulation time. (Continuous line: 

an performance curve; dotted line: mean 
Parameter curve.) 
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line in Fig. 1. Since not all Ss ex- 
hibited the regular pattern of in- 
crement in the aftereffect with in- 
creasing stimulation time, this second 
plot differs from the mean perform- 
ance plot. Both curves however, show 
an exponential increase in kinesthetic 
aftereffect as a function of stimulation 
time, the mean parameter curve tend- 
ing toward an asymptote at an earlier 
stage than the mean performance fit. 


EXPERIMENT II 


There is good reason for supposing 
that the development function in Exp. 
I represents an interaction between 
stimulation time and different dissipa- 
tion rates of the aftereffect for differ- 
ent stimulation times. Oyama (1953) 
in an extension of earlier experiments 
by Hammer (1949) showed that the 
dissipation rate of a visual-spatial 
aftereffect was relatively fast for 
short stimulation periods and slower 
for the longer stimulation durations. 
Since in Exp. I there was a mean time 
lapse of 17 sec. for the posttest, it is 
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possible that for the shorter stimula- 
tion periods especially, much of the 
effect would have dissipated by the 
completion of the adjustment. Less 
dissipation would be expected to occur 
for the longer stimulation periods. 

The aim of Exp. II was to establish 
the dissipation functions for the 
kinesthetic aftereffect for two stimula- 
tion times: 30 sec. and 90sec. If these 
functions varied in rate then it would 
be reasonable to assume that the 
curves of Fig. 1 are joint functions of 
development and dissipation, the lat- 
ter being an artifact of the adjustment 
method. 


Method 


Apparalus.—The same apparatus as in 
Exp. I was used. 

Subjects—There were two groups of 12 
adult male and female Ss all of whom were 
either graduate or senior students in the 
Department of Psychology. The Ss varied in 
their knowledge of spatial aftereffects but 
they had not previously performed the 
particular task. 

Experimental procedure and design.—Apart 
from stimulation times and the intervals 
elapsing between cessation of stimulation and 
posttest, all conditions were the same as in 
Exp. I including the alternation of directions 
of tilts and the 3-min. interval between trials, 
After cessation of stimulation S rested with 
hands on knees and, at"a"signal from E, the 
hands were replaced and the posttest adjust- 
ments made. In the 0-sec. condition the 
procedure was the same as in Exp. I. 

Each S underwent 10 randomized pairs of 
trials in two blocks of five with at least 1 day 
between blocks. The 10 intervals between 
cessation of stimulation and initiation of the 
posttest were 0, 15, 30, 45, 60, 75, 90, 105, 
120, and 180 sec. For half the Ss in each 
group this order was reversed. One group of 
12 Ss underwent 30 sec. stimulation and the 

other group 90 sec. A record was kept of the 
times taken to complete the posttest adjust- 


ment. These times had a mean of 12 sec. 
ment. 


Results 


In Table 1 are shown the mean 


-8g 1 
ifferences and SDs between pre- anc 
cdm for the 10 intervals between 


cessation of stimulation and com- 
mencement of posttest under the two 
stimulation conditions (30 sec. and 
90 sec.). Following the procedure 
adopted in Exp. I, two curves have 
been fitted to these data: one based 
on the mean performance and the 
other derived from the mean of the 
individual parameters. The mean 
performance curves for the two condi- 
tions are shown as continuous lines in 
Fig. 2 and mean parameter curves as 
dotted. 

Since the mean time taken for E 
to reset the bar and for S to make the 
adjustment was 12 sec., this time has 
been added to each of the intervals so 
that 0 sec. becomes 12 sec., 15 sec. 
becomes 27 sec., etc. It should be 
noted, however, that since the pro- 
gram used for fitting the curves re- 
quires equal intervals between values 
of the independent variable (i.e., be- 
tween delays from cessation of stim- 
ulation to posttest) the curve has not 
been fitted to the 180-sec. data point 
(i.e., 192-sec. plotted point in Fig. 2). 

Whereas there is little difference be- 
tween the two curves for the 90-sec. 
stimulation condition, for the 30-sec. 
condition the mean parameter curve 
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, Fic. 2, Kinesthetic aftereffect as a func- 
tion of time after cessation of stimulation 
Periods of 30 sec. and 90 sec. (Continuous 
lines: mean Performance curves; dotted lines: 
mean parameter curves. The computed 
points for intersection with the ordinate— 
Le., O-sec. delay—are also shown.) 
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tends to an asymptote at a higher 
value (about 0.6?) than the mean 
performance curve. This difference 
1s probably due to one S who exhibited 
a dissipation function opposite in 
direction to the other 11 Ss. 

Since the original 0-sec. condition 
has become 12 sec. because of the time 
Occupied by the posttest adjustment, 
it has been necessary to compute 
values for the true 0-sec. condition. 
These derivations are 5.3 for the 30- 
Sec. stimulation condition and 4.4 for 
the 90-sec. condition, and both points 
are shown in Fig. 2 on the ordinate. 


Discussion 


_ If the data of Exp. I and II are con- 
Sidered together it is reasonable to con- 
clude that the growth of the aftereffect 
as a function of stimulation time (Fig. 1) 
represents an interaction between the 
Possible incremental effects of stimula- 
tion time and the dissipation effects 
Occurring during the posttest adjustment. 
The method of adjustment necessarily 
p Volves a time lapse after stimulation. 
si E that period the aftereffects dis- 
i ate at a rate determined by stimula- 
jon duration (Fig. 2). Thus on com- 
Detion of the posttest, it would be 
Pater, that much of the effect would 
=e dissipated during the 17-sec. adjust- 
aiat Deriod. Further, since the rate of 
Sipation can be assumed to be greater 
tha the shorter periods of stimulation 
Be after 17 sec., the aftereffect would 
tl Smaller for the shorter duration times 
van for the longer. It can be assumed, 
dup that using a psychophysical 
Co Od with a shorter response latency 
fee oe stimuli or limits) the develop- 
injg,; ""ction would exhibit a steeper 
‘al rise and achieve a maximum after 
su Nefer stimulation period than that 
Sgested by Fig. 1. 
ie results from Exp. II show that the 
'al sizes of the aftereffect soon after 
dine and 90 sec, stimulation are similar, 
th In point of fact, while 
Sec. stimulation plot is for the 


m E 
95t part below that of the 90-sec. in 


Fig. 2, its initial computed magnitude is 
greater. Both sets of data conform 
closely to an exponential decay function. 

There is a striking resemblance be- 
tween the functions found here for a 
kinesthetic aftereffect and those reported 
by Oyama (1953) for the visual after- 
effect. In the latter experiments it was 
shown that following stimulation periods 
ranging from 1 sec. to 240 sec. the initial 
magnitudes of the effect were constant, 
but that dissipation was very rapid for 
the short periods and slow for the longer. 
It was also demonstrated by Oyama 
that, if the delay between stimulation 
and judgment was very short (1 sec.), 
the development function was rapid, 
achieving its maximum after about 25 
sec. If the interval was controlled at 5 
sec. however, the maximum effect did 
not occur until after 240 sec. stimulation. 

The exponential functions found by 
Oyama (1953) for the development and 
dissipation of a visual-spatial aftereffect 
and those reported here for the kines- 
thetic effect suggest that these two after- 
effects behave in a similar fashion as a 
function of stimulation time and time 
after stimulation. 

In conclusion, a point of general 
methodological interest can be raised. 
The mean adjustment times for the 
younger group of Ss in Exp. I was 17 sec., 
while that for the adult groups of Exp. II 
was 12 sec. The measured size of both 
visual and kinesthetic aftereffects using 
the method of adjustment is determined 
by both stimulation time and adjustment 
time. It is to be expected, therefore, 
that, if the adjustment times found here 
are a true indication, more of the effect 
will dissipate with younger than with 
older Ss during the posttest. This ex- 
pectation is critical for developmental and 
comparative studies of spatial aftereffects 
where differences in the size of the after- 
effects are often claimed. For example, 
the smaller visual effect found by 
Prysiazniuk and Kelm (1963) for re- 
tarded adults after 40 sec. stimulation as 
compared with the effect manifested by 
normal Ss could well be due to the 
different adjustment times for the two 
groups. 
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EFFECTS OF SHOCK INTENSITY AND PLACEMENT 
ON THE LEARNING OF A FOOD-REINFORCED 


BRIGHTNESS DISCRIMINATION ! 


ELIZABETH R. CURLIN asp JOHN W. DONAHOE 


University of Kentucky 


The effects of shock intensity (.13 ma. or .30 ma.) and shock placement 
(S+ or S—) on the learning of a brightness discrimination were in- 
vestigated. 30 albino rats served as Ss in a bar-pressing task. Low- 
intensity shock (.13 ma.) facilitated the rate of discrimination formation 
whether placed in S+ or 5—. ‘The omission of .13 ma. shock after the 
attainment of a stable discrimination left performance relatively un- 
changed. Moderate intensity shock (.30 ma.) facilitated discrimination 
performance when placed in S— and hindered performance when placed 
in S+. The omission of .30 ma. shock produced an immediate de- 
terioration in the performance for those Ss shocked in S— and an im- 
provement in performance for those Ss shocked in S+. It was con- 
cluded that low-intensity shock acts primarily as an additional stimulus 
to facilitate discrimination learning, whereas moderate intensity shock 


acts primarily as an 
formance. 


In the present experiment, a bar- 
Pressing task was used to assess the 
effects of two intensities of shock on 

e learning of a food-reinforced 
rightness discrimination. A response 
in the positive stimulus (S+) pro- 
“uced shock for half the animals; a 
response in the negative stimulus 
(S—) produced shock for the remain- 
ng half. It was hypothesized that 
OW-intensity shock serves predomi- 
nantly as a discriminative stimulus 
Cue function) whereas high-intensity 
Shock serves predominantly as an 
aversive stimulus (motivational func- 
tion). This hypothesis is a particular- 
ization of Miller and Dollard’s (1941) 
tatement that, “The stronger the 
Stimulus, the more drive function it 
Dossesses [p. 18]" At low intensities 

shock, discrimination formation 
Should be facilitated as the shock 

The re orted by grants to 
the Second Mis e ate Faculty Research 
Pubi; University of Kentucky, and from the 

ublic Health Service (MH-07332-01). The 
s Computing 


thot University of Kentucky, for the use of 


aversive (motivational) stimulus to affect per- 


stimulus (whether it occurs in S4- or 
S—) increases the difference between 
the stimulus conditions prevailing 
during the reinforcement and the non- 
reinforcement of the bar-pressing re- 
sponse. At higher shock intensities, 
however, the aversive properties of 
shock should decrease the frequency 
of those responses which produce it 
and thereby facilitate discrimination 
formation only when placed in S— 
and hinder discrimination formation 
when placed in S+. 


METHOD 


Subjects —Thirty male albino rats (Wistar 
strain, Harlan Industries), 80-90 days old, 
served as Ss. One week before training all 
animals were reduced to 80% of normal body 
weight and were maintained at this level for 
the duration of the study. All animals were 
fed a measured daily ration 1 hr. after the 
completion of each experimental session. 
Throughout the experiment, Ss were housed 
in individual living cages in a temperature- 
controlled room. 

Apparatus.—Two Skinner boxes of iden- 
tical construction were used. Each box was 
enclosed in a sound-deadened hull with a 
blower to furnish ventilation and a masking 
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noise. A Gerbrands response bar was cen- 
trally located in one wall of each chamber 
with a food cup 1 in. to the left and a drinking 
tube 1 in. to the right of the bar. A pilot 
lamp with a 4-in. white jewel was mounted 
2$ in. above the bar and served as the 
discriminative stimulus. The chamber was 
dimly illuminated at all times by a houselight. 
A constant current shock supply with a 
scrambler (Grason-Stadler E1064GS) was 
used to electrify the grid floors, walls, bar, 
food cup, and drinking tube. The experi- 
mental events were programed by means of 
standard relay and timing equipment and the 
responses were monitored by printing counters. 
Procedure.—All Ss received a pretraining 
period which consisted of three sessions of 
continuous reinforcement. During the first 
session, bar pressing was acquired through the 
method of approximations with 10-15 gratui- 
tous pellets being delivered to each animal. 
The first session was terminated at the end of 
1 hr. or after 120 reinforced responses, which- 
ever occurred first. For the remaining two 
sessions of pretraining, a 45-mg. food pellet 
was placed in the food cup at the beginning of 
each session and bar pressing was continuously 
reinforced for 15 min. During pretraining, 
the houselight Provided the sole source of 
illumination. 
Discrimination training was then begun 
with continuous reinforcement scheduled 
when the pilot lamp was on (S+) and extinc- 
tion scheduled when the lamp was off (S—). 
The various treatment conditions were formed 
by randomly subdividing Ss into the following 
five equal groups: (a) no shock in either S+ 
or S— (a no-shock control group, Group C), 
(b) .13 ma. shock in S+ (13 S+ group), (c) 
.13 ma. shock in S— (.13 S— group), (d) .30 
ma. shock in S4- (.30 S4- group), and (e) .30 
ma. shock in S— (.30 S— group). For the 
four experimental groups, the shock stimulus 
was .50 sec. in duration and was administered 
immediately after the depression of the bar. 
Of the six animals in each group, three were 
run in each of the two Skinner boxes. In 
summary, the experimental plan consisted of 
a 2X2 factorial design having shock in- 
tensity (.13 ma. or .30 ma.) and shock place- 
ment (S4- or S—) as variables of classification 
with the addition of a single no-shock control 
er sessions of discrimination training 
der the appropriate shock conditions were 
ups sion was composed of 12 dis- 
pah 5 os ch cycle consisting of 
crimination cycles, ea M (pie <i 
i followed by 120 sec. in S—. 
120 sec. in S+ he shock in- 
* Il experimental groups the shoc 
: mele was gradually increased to the appro- 
nsi a X E a. 
poate value. For the low-shock (.13 ma.) 
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groups, shock intensity was .08 ma. for Cycles 
1-6 of the first session and remained constant 
at .13 ma. thereafter. For the higher shock 
(.30 ma.) groups, the shock intensity was .08 
ma. for Cycles 1-6 and .13 ma. for Cycles 7-12 
of the first session, .13 ma. for Cycles 1-6 
and .25 ma. for Cycles 7-12 of the second 
session, and .25 ma. for Cycles 1-6 of the 
third session with the intensity remaining 
constant at .30 ma. thereafter. The gradual 
increase in shock intensity avoided the com- 
plete suppression of bar pressing occasionally 
Produced by an initial shock of .30 ma. and 
provided additional data on the effects of low- 
intensity shock in the early stages of dis- 
crimination learning. Thus the shock inten- 
sity was the same for the four experimental 
groups during the initial 18 cycles of training. 
After eight sessions of discrimination 
training, all Ss were given an additional two 
sessions in which shock was omitted. The 
changes observed in postshock performance 
were intended to provide information concern- 
ing the type of control exercised by shock 
stimulation during the first eight sessions. 


RESULTS 


Performance was measured by two 
dependent variables: (a) the propor- 
tion of the total responses occurring 
to S+ and (b) the number of responses 
occurring to S+ and S=. The S+ 
proportion provided a summary sta- 
tistic which indicated the level of 
discrimination directly while the num- 
ber of S+ and S— responses is the 
primary data which determine the 
level of discrimination. Unless other- 
wise indicated, data from the first two 
discrimination Cycles were excluded 
from the following analyses. This 
was necessitated as the .13-ma. and 
control groups consistently improved 
1n performance within all training ses- 
sions whereas a “warm-up” effect of 
comparable magnitude did not occur 
with the -30-ma. groups. The first 
two cycles of each session were there- 
fore eliminated to minimize the con- 
founding of intergroup differences 
with unequal warm-up rates. 

Effect of .13-ma. shock.—During the 
first 18 cycles of discrimination train- 
ing when all experimental groups were 


EFFECTS OF SHOCK INTENSITY AND LOCATION 


receiving a maximum shock intensity 
of -13 ma., the proportion of responses 
in S+ was greater for the experi- 
mental groups than for the control 
group, ¢ (28) = 2.58, p < .02. Con- 
sidering the performance of the 
-13-ma. groups across all training ses- 
sions, low-intensity shock facilitated 
the rate of discrimination formation 
whether placed in S+ or S— (see 
Fig. 1). The rate of change in the S+ 
Proportion, as measured by least- 
Squares estimates of the slope param- 
eter? indicated that the .13-ma. 
groups acquired the discrimination 
more rapidly than the control group, 
t (20) = 2.51, p «.05. The more 

* In order to secure a single measure of the 
rate of discrimination formation, the S+ 
Proportion and session number underwent a 
reciprocal transformation appropriate to the 
mo „form of the hyperbolic function, 

7 X/(a--bX). For all groups having 
Positive slopes (.13 S+, .13 S—, 30 S—, and 

), the median index of correlation was .923 


br noces c rat 
' individual animal curves and .977 for 
group curves, 
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rapid increase in the S+ proportion 
for the low-shock groups was primarily 
due to an abrupt decrease in the fre- 
quency of S— responding (see Fig. 2, 
Panel A). Whereas the frequency of 
S— responding decreased significantly 
between Sessions 1 and 2 in the .13 
S+ and .13 S— groups, ¢ (175) 
= 5.89, p < .001 and ¢ (175) = 3.89, 
p < .001, respectively, the compar- 
able difference was not reliable in 
Group C, # (175) = 1.54, p < .20. 
The number of responses in S+ tended 
to be greater than the control group 
in the .13 S— group and less in the .13 
S+ group although neither difference 
was significant. The asymptotic level 
of discrimination, as measured by the 
S+ proportion during Session 8, was 
the same for Groups .13 S+, .13 S-, 
and C. 

In the first no-shock session (Ses- 
sion 9), the S+ proportion remained 
unchanged for the low-shock and 
control groups (see Fig. 1) The fre- 
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S* PROPORTION 


SESSIONS 
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Fic. 1. Proportion 


of responses in the positive stimulus (S+) during shock 


sessions (1-8) and during no-shock sessions (9 and 10). 
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Number of Tesponses in the positive stimulus (S 
during shock sessions (1-8) and durin 


S+ responding; the broken lin 
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SESSIONS 


+) and negative stimulus (S—) 
g no-shock sessions (9 and 10). (The solid line represents 
e represents S— responding.) 


CYCLES 


oportion of responses in the positive stimulus (S+) during the 
indio Ly wires cycles of the first no-shock session (9). 


EFFECTS OF SHOCK INTENSITY AND LOCATION 


quency of responding was constant in 
Groups C and .13 S+, but increased 
in the .13 S— group particularly in 
S—,1(167) = 3.22, p < .01. Figure 
3, in which the S+ proportion is 
plotted for each of the 12 discrimina- 
tion cycles within Session 9, indicates 
that the .13 S+ and control groups 
performed at a similar level whereas 
the .13 S— group remained non- 
significantly below them. By the 
Second no-shock session (Session 10), 
the S+ proportion had fallen signifi- 
cantly below the level of Session 8 for 
the .13 S— group, ¢ (50) = 3.29, 
P «.01. As shown in Fig. 2, the 
decrease in the S+ proportion for the 
:13 S— group was due to an increase 
In S— responding. The .13 Sc 
8roup, by contrast, displayed an in- 
Crease in the S+ proportion in 
the second postshock session, ! (50) 
= 2.47, p < 02. 

Effects of .30-ma. shock—The .30 
S+ group showed an initial increase 
im the S+ proportion followed by a 

€crease as the shock intensity reached 
30 ma, (see Fig. 1). This trend, 
Which was unlike that found in any 
ther group, caused a significant 
Cubic component, F (4, 171) = 7.28, 

.€ .001, in the interaction of groups 
With sessions (see Table 1). The .30 
— group consistently produced a 
high Proportion of responses in S+. 

7e rate at which the discrimination 
developed in the .30 S— group did not 

ler, however, from the control 
SrOup as measured by the slope esti- 
Mates, 1 (20) = 1.33, p «20. As 
Shown in Fig, 2, Panel B, the initial 
elfect of increasing the shock intensity 
in the .30-ma. groups was to decrease 
the frequency of responding in both 
S+ and S— regardless of the stimulus 
in which shock occurred. The de- 
Crease in response frequency between 
essions 1 and 3 was significant for the 


30 S+ and .30 S— groups in both 
S+ and S— (p « .05 in every case). 
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TABLE 1 


ANALYSIS OF VARIANCE OF THE PROPORTION 
or RESPONSES IN S++ DURING 
DISCRIMINATION TRAINING 


Source df Fa 
Between animals 29 
Control group vs. exp. 
groups 1 0.33 
Shock level (L) 1 6.86* 
Shock placement (P) 1 |58417*** 
LXP 1 |63.44*** 
Animals within groups 
25 | (0.036) 
Within animals 206 ^ 
Sessions (S) 7 |18.50*** 
S X Groups 28 8.58*** 
linear 4 |44.75*** 
quadratic 4 | 2.26 
cubic 4 nes 
residual 16 1.51 
S XA 171 | (0.012) 


a Values in parentheses are MSs. 

b Due to failure of the printing counter, 4 df have 
been lost for estimation of the missing data. 

* This source of variance is a pooled error term 
following the satisfaction of F tests for homogeneity of 
the various polynomial components (linear, quadratic, 


tc.). 
"P 
++p «€ .001. 


As the discrimination sessions con- 
tinued, the S+ proportion for the .30 
S+ group remained significantly be- 
low the control group, t (25) = 4.32, 
p « .001, and the S+ proportion for 
the .30 S— group remained signifi- 
cantly above the control group, £ (25) 
= 2.08, p < .05. The difference in 
S+ proportion found between the .30- 
ma. groups, but not between the .13- 
ma. groups, gave rise to a significant 
interaction of shock placement with 
shock intensity, F (1,25) = 63.44, 
p < .001 (see Table 1). The changes 
in response frequency were also differ- 
ent as a function of shock intensity. 
In the .30 S+ group, responding in 
S+ remained at a low level through- 
out training, but responding in S— 
increased until it was significantly 
above the S— response frequency of 
the control group at the eighth 
session, £ (25) = 2.46, p < .03. Thus 
the S+ proportion of the .30 S+ 
group was depressed not only because 
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of a low response rate in S+, but also 
because of a relatively high response 
rate in S—. Inthe .30 S— group, the 
frequency of responding in S+ grad- 
ually increased after the initial decline 
but remained below the S+ frequency 
of Group C, # (25) = 3.67, p < .01. 
The S— frequency remained at a very 
low level throughout the shock ses- 
sions. A .30-ma. shock, therefore, 
produced different effects during dis- 
crimination training than a .13-ma. 
shock. Responding under .30-ma. 
shock was generally less frequent, 
F (1, 25) = 30.24, p < .001, and dis- 
played a different trend in S+ and 
S— over sessions, F (28, 169) = 6.88, 
p < .001 (see Table 2). 

The postshock behavior of the 
.30-ma. groups also differed from the 
.13-ma. groups. On the first post- 
shock session, the S+ proportion of 
the .30 S+ group increased, £ (25) 
= 7.28, p < .001 and the S+ propor- 


TABLE 2 


ANALYSIS OF VARIANCE OF THE FREQUENCY OF 
RESPONDING IN S++ AND S— DURING 
DISCRIMINATION TRAINING 


Source df Fe 
Between animals 29 
Control group vs. exp. 
groups 1 | 34.21*** 
Shock level (L) 1 |3024*** 
Shock placement (P) 1 1.76 
DX IE cun 1 | 3.36 
Animals within groups 
PS. 25 | (7,417.) 
Within animals 442b 
Sessions (S) 7 | 10.19*** 
S X Groups 28 2.7599 
SxXA ban ees 175 | (1,239.) 
S+ vs. S— discrimina- 
tion (D) E pcm 
Groups 11.42*9* 
D X A 25 | (4,809.) 
SxD 7 |24.89*** 
S x D X Groups 28 6.88*** 
SXDXA 167 | (921.) 


a Va in parentheses are MSs. 
b Values failure of the printing counter, 8 d f have 
been lost for estimation of the missing data. 
sen los 
pect p < .001. 
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tion of the .30 S— group decreased, 
t (25) = 6.69, p < .001 (see Fig. 1). 
The abruptness of these changes in 
S+ proportion are shown in Fig. 3. 
The improvement of performance in 
the .30 S+ group and the deteriora- 
tion of performance in the .30 S— 
group both became more pronounced 
on the second postshock day. The 
changes in responding which produced 
the alteration in the S+ proportion 
are shown in Fig. 2. The removal of 
shock in the .30 S+ group brought 
about a sharp increase in both S+ 
and S— responding, ¢ (167) = 10.28, 
P < .001 and t (167) = 5.52, p < .001, 
respectively. On the second post- 
shock day, S— responding declined, 
t (167) = 6.64, p < .001. In the .30 
S— group, the removal of shock was 
followed by an increased response 
frequency in S—, 1 (167) = 3.78, 
P «.001. The increase in S— re- 
sponding continued unabated during 
Session 10, ¢ (167) = 5.03, p < .001. 


DiscussioN 


A shock of .13-ma. intensity, whether 
placed in S+ or S—, produced a more 
rapid increase in the S+ proportion than 
found in the no-shock control group. In 
both .13-ma. groups, the increased rate 
of discrimination formation was due to 
an abrupt initial decline in the frequency 
of S— responding. This finding is inter- 
preted as supporting the hypothesis that 
low-intensity shock stimulation facili- 
tated the learning of the discrimination 
by increasing the difference between the 
stimulus conditions present during the 
reinforcement and nonreinforcement of 
the bar-pressing response. That is, shock 
Served as another cue, in addition to light 
onset and the aftereffects of reinforce- 
ment, which became associated with bar 
pressing through differential reinforce- 
ment. Viewed in this way, the shock 
stimulus facilitated discrimination learn- 
ing through the mechanism of habit 
summation previously documented in 


EFFECTS OF SHOCK INTENSITY AND LOCATION 


the T maze (Eninger, 1952). The ability 
of shock per se to serve as a discrimina- 
tive stimulus controling the rate of 
responding has been demonstrated by 
Holz and Azrin (1961). Pigeons were 
trained to peck a key under conditions in 
Which a 2-min. variable-interval rein- 
forcement schedule alternated with ex- 
tinction periods. Responding during the 
variable-interval schedule produced both 
food and shock. In extinction sessions, 
response-produced shock increased the 
frequency of pecking above the no-shock 
periods. 

Even at an intensity of .13 ma., there 
Was some tendency for shock to be 
aversive. In the .13 S+ group, S+ 
responding remained nonsignificantly be- 
low control levels during shock sessions 
and, in both .13-ma. groups, S+ and S- 
responding increased during postshock 
Sessions. The postshock S+ proportion 
and response frequency remained more 
Stable, however, than with .30-ma. shock. 

he most pronounced increase in post- 
Shock responding—that occurring in s- 
in the 13 S+ group—cannot be un- 
quivocally interpreted as the removal of 
the shock stimulus could produce an 
increase in responding due to both the 
elimination of aversive stimulation and 

€ institution of stimulus conditions 
More similar to those previously present 
in S+ (ie, no shock). Of the two 
factors, the cue variable is probably the 
More important as S— responding was 
Not depressed below control levels during 
the last four shock sessions in the -13 S= 
Sroup. It should also be noted that 
oth cue and motivational factors would 
avor an increase in response frequency 
only in the S— condition of the .13 5— 
Sroup, 

The effects of .30-ma. shock on per- 
formance were quite different from those 
Obtained with .13 ma. The initial 
response to .30-ma. shock wasa decreased 
frequency of responding to both S+ and 

In the .30 S+ group, S+ respond- 
Mg remained at a low level throughout 
the shock sessions while S— responding 
Sradually increased to a frequency above 
Control levels, The reduction in S+ 
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responding was clearly a result of the 
aversive nature of .30-ma. shock as 
responding rapidly rose to control levels 
on the first postshock day. The increase 
in S— responding in the .30 S+ group 
may be interpreted as an example of dis- 
placement (Miller, 1948, 1959). Thus 
S— responding, which was initially 
produced via generalization from S+ and 
pretraining, was thereafter maintained 
by fear reduction which occurred when 
bar pressing in S— was not followed by 
shock. In support of this interpretation, 
the frequency of S— responding in the 
.30 S+ group rose sharply in the first 
postshock session and then declined 
rapidly with continued exposure to no- 
shock conditions. 

The .30 S— group attained a higher 
level of discrimination than the control 
group although the rate of approach to 
the asymptotic value of the S+ propor- 
tion was not more rapid. As further 
evidence for the predominance of aver- 
sion with .30-ma. shock, the superior 
performance of the .30 S— group was 
achieved through a marked depression 
in the frequency of S— responding. 
Shock performance of both .30-ma. 
groups was therefore consistent in in- 
dicating that response frequency covaried 
with the stimulus in which shock oc- 
curred rather than the stimulus in which 
food occurred. In postshock sessions, 
the frequency of S— responding in- 
creased rapidly in the .30 S— group. As 
with the postshock responding of the .13 
S— group, a cue factor could have con- 
tributed to the increase in S— responding 
although a motivational factor provides 
the more likely interpretation in the .30 
S— group because of the suppression of 
S— responding during shock sessions. 
The finding that punishment failed to 
permanently suppress nonextinguished 
S— responding is consistent with the 
work of Skinner (1938) and Estes (1944). 
In summary, .30-ma. shock in S— is 
most readily interpreted as having pre- 
dominantly aversive properties which 
produced a greater proportion of re- 
sponses in S+ through punishment of 
S— responding rather than through 
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facilitation of the learning of the bright- 
ness discrimination. 

The finding that a low-intensity shock 
(.13 ma.) in either S+ or S— facilitated 
discrimination learning whereas a mod- 
erate intensity shock (.30 ma.) facilitated 
performance when placed in S— and 
hindered performance when placed in S+ 
is relevant to the Muenzinger-Wischner 
controversy concerning the “shock-right” 
effect. Although differences in procedure 
(Muenzinger and Wischner used modified 
T mazes) make detailed comparisons 
difficult, both sets of experiments suggest 
that shock intensity is a critical param- 
eter which determines whether shock in 
S+ will facilitate or hinder discrimina- 
tion performance. Muenzinger (1934; 
Muenzinger, Bernstone, & Richards, 
1938) has found a facilitating effect of 
shock in S+ and has always employed a 
shock of low intensity (.10-.15 ma.).3 
Moreover, he has reported that facilita- 
tion is more pronounced if the aversive 
effects of shock are reduced through a 
pretraining adaptation procedure (Muen- 
zinger, Brown, Crow, & Powloski, 1952). 
Wischner (1947), who found that shock 
in S+ hindered discrimination, used a 
shock of moderate intensity—.30 ma. 
In a later study (Wischner, Fowler, & 
Kushnick, 1963) in which shock intensity 
was varied, Wischner reported that dis- 
crimination was hindered Progressively 
as shock intensity increased from .15 to 
-25 ma. The import of all investigations 
is therefore consistent in indicating that 
the facilitation of performance by shock 
in S+ is confined to low-intensity shock 
having minimal or no aversive Properties. 
The fact that Muenzinger employed a 
correction method and Wischner a non- 
correction method is also an important 
feature contributing to the differences in 
their results and has been fully discussed 
by both authors (Muenzinger & Pow- 
loski, 1951; Wischner, 1947), 


3 To facilitate a comparison of the various 
experiments, all shock intensities have been 
transformed to units of current (ma.) where 
necessary by assuming an animal resistance 


of 300,000 ohms. 
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Generalization of an instrumental response in the rat to stimuli varying 
in the size dimension was investigated under 12 and 48 hr. of food 


deprivation. 


Generalization gradients for the 48-hr. condition were 


higher and of steeper slope than those for 12 hr. The data conform 
rather precisely to the implications of a multiplicative theory of habit 
and drive. The group means of 2 response measures, speed of the Ist 
test response, and number of extinction responses to the test stimuli, 
were linearly related with a correlation of .99. Implications of such 
findings for a reaction-potential concept are discussed. 


It appears to be a rather general 
finding that the amount of stimulus 
Seneralization increases with increas- 
ing levels of drive. The situation con- 
cerning the slope of generalization 
functions as a function of motivational 
Variables is somewhat less clear. 

Tull’s multiplicative theory of habit 
and drive clearly predicts that these 
Slopes should be steeper with higher 

tive levels under circumstances in 
Which the response measure is linearly 
Telated to excitatory potential and 
Ceiling effects are not involved. This 
1S in direct opposition to the fre- 
quently expressed, commonsense view 
that high drive impairs discrimination 
and, hence, might be expected to 

atten generalization gradients. The 
Present experiment is an investigation 
9f the effects upon size generalization, 
M the rat, of variations in food dep- 
“vation. The experiment was con- 

Ucted in the situation previously 
used in generalization studies by Grice 
and Saltz (1950), Kling (1952), and 
‘Da i i aper were 
collected, in s aint in ia PhD thesis by 
Th, first author at the University of Naer 
C. i present report was supported E em 
tion 23 from the National Science Fou! : 

„and MH-08033-01 from the Nationa 
‘stitutes of Health to the second author. 


Margolius (1955). Response speed 
and resistance to extinction are pre- 
sented as two measures of response 
strength, and the relations between 
these two measures are reported. 


METHOD 


Subjects —The Ss were 120 experimentally 
naive albino rats of Sprague-Dawley strain. 
They ranged in age from 80 to 103 days at the 
start of the experiment. There were 65 
females and 55 males divided as evenly as 
possible among the experimental groups. 
Nine additional Ss were eliminated from the 
experiment: 4 for failure during pretraining, 
and 5 for illness or death. 

Apparatus.—The apparatus has been de- 
scribed in detail by Grice (1949). In brief, it 
consisted of a 2-ft. alley with a vertical sliding 
door in the middle. The stimulus objects 
were exposed at one end, and were mounted 
at the front of a 2-in. extension of the alley. 
The alley was pivoted at the center, so that 
each end became alternately the start box 
and the response compartment. The stimuli 
were white, circular, metal disks with closely 
fitting, square swinging doors in the center. 
The rat obtained a food pellet from a small 
tray attached to the back of the disk, by 
nosing open the door. The disks employed 
were 79, 50, 32, and 20 cm.? in area, represent- 
ing equal steps of log area. 

Training procedure.—The training pro- 
cedure has also been described in detail else- 
where (Grice & Saltz, 1950). On the first 
day, Ss were allowed to eat 20 pellets from the 
food tray with the swinging panel completely 
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open. On the second day, they received 10 
trials, during which they were trained to push 
the panel by an approximation procedure. 
On each of the next 3 days, Ss received 20 
training trials with the panel closed. La- 
tencies were recorded from the raising of the 
start door at the center of the alley to the 
moving of the swinging panel by the animal. 
The 79-cm.? disk was used as the training 
stimulus for all animals. 

During training, the hunger drive was 
produced by approximately 234 hr. of food 
deprivation. Immediately following the 
training session each S received a 3-hr. feeding 
period, receiving Purina laboratory chow in 
an individual feeding cage. All Ss were 
adapted to this schedule for 6 days prior to 
the start of the experiment. Food pellets 
used as rewards weighed approximately 
.05 gm. 

Generalization lests.—Generalization tests 
were conducted 48 hr. following the third 
acquisition session, each animal being run 
at approximately the same time of day as in 
original training. Half of the Ss were tested 
under 48 hr. of food deprivation and half 
under 12 hr. Each S received a 1-hr. feeding 
at the appropriate time before testing. This 
feeding included wet mash as well as the 
usual pellets. 

The actual generalization testing was as 
described by Grice and Saltz (1950). Follow- 
ing 5 additional training trials to the 
original training stimulus 35 extinction trials 
were conducted with the test stimulus. If Ss 
failed to respond in 30 sec., the trial was 
terminated and the next trial was run. Sepa- 
rate groups were run with each of the four 
stimuli under each drive level. There were 15 
Ss in each of the eight experimental groups. 

Response measures.—Two measures of re- 
sponse strength to the test stimuli are pre- 
sented. Resistance to extinction consists of 
the number of panel pushes during the 35 
extinction trials. The other measure is speed 
of the first response—1,000/Latency. If Ss 
failed to respond on the first test trial, time 
was cumulated over one or more additional 
trials until the first response. This manner of 
scoring latency 18 somewhat unusual, but it 
has previously been used successfully by Grice 
(1956). Following the reasoning of Estes 
(1950) and Spence (1954) in which latency is 
deduced as a function of momentary response 

bability, it should provide as good an 
pe i f -eaction potential as the more con- 
pe Fal riethod. It also has the important 
Y adasi of permitting the obtaining of the 


extinction measure. 
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Fic. 1. Generalization gradients based 0” 


speed of the first-test response. 


RESULTS 


The mean speeds of the first test- 
trial response are presented in Fig. 1, 
and the mean numbers of responses 17 
extinction to the test stimuli are 
presented in Fig. 2. The smooth 
curves drawn through the points an¢ 
the transformation of Fig. 2 are dis- 
cussed below. Analyses of variance 
of these data are presented in Table 1. 
The analyses generally support con- 
clusions drawn from an inspection 9 
the means. The greater levels O 
response strength with the 48-hr. 
deprivation are supported by the 5187 
nificant effects for Drive Level. The 
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Vic. 2. Generalization gradients based on 
number of responses in 35 extinction trials to 
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TABLE 1 


ANALYSES OF GENERALIZATION Test DATA 


F 
Source df 
Speed Extinction 
Drive (D) 1 26.62*** 255°" 
Stimuli (S) 3 61.81*** 19.83*** 
Stimuli within 48 hr. (3) m PE 13.53 
Linear trend (1) p 22.58*** 37.83*** 
Wide So (84 3072) n 
Jithin stimuli S 1807.2)" 23.1)" 
Stimuli within 12 hr. (3) 8.69777 T0570 
Linear trend (1) 25.68 20.57 
Residual (2) T 0.19 029 
Within stimuli (56) (17,518.3)* (14.2) 
DXS 3 1.83 2.53 
Dif erengs in linear trend [ol a TT 
esidual 2 2 ` 
Residual withi ; 116 (50,288.0)* (18.6)* 
Within uas ithin drive qua (51:162.7)* (18.6) 
l'otal 119 
py ror. 
b < O01. 


Significant Stimulus effects and the 
significant linear trends indicate that 
generalization decrement occurred un- 
der both levels of drive. The signifi- 
cant effects for Difference in Linear 

rend suggest that the apparently 
Steeper slope for the high-drive condi- 
tion is genuine. The conclusions are 
essentially the same for both response 
measures, 


DıscussIoN 


Generalization and drive strength.—The 
results of this experiment agree with the 
Implications of the multiplicative theory 
of habit and drive that generalization 
gradients should be both higher and 
Steeper under high-drive conditions than 
^t lower levels of drive. In view of the 
oe that response speed is generally 
egarded to be linearly related to reaction 
Potential (E), a further analysis of these 

ata has been made to investigate the 
j; rek to which the multiplicative theory 
fon PPorted. Assuming an exponential 
See the gradient, the following ex- 
Ssion may be written: 


E = D(H x 107) 


E indicates excitatory potential; D, 
drive; H, habit strength to the training 
stimulus; d, the number of logarithmic 
units between the training and test 
stimuli; and b is an empirical constant. 
Letting D equal unity for the 48-hr. con- 
dition, the following equation was fitted 
to these data: 


Ea = 658 X 1070299 


This equation is plotted as the upper 
curve in Fig. 1. Under the multiplicative 
theory, the 12-hr. data should be ap- 
proximated by multiplying this expres- 
sion by a lower value for D, less than 
unity. Such a parameter was estimated 
from the 12-hr. data, and the lower 
curve in Fig. 1 represents the above 
equation multiplied by the lower value, 
0.43. The new equation becomes: 


Ex. = 283 X 1070-1984 


The fit to the 12-hr. data is good and 
supports the implication that the only 
difference between the two functions i: 
difference in a multiplicative constan 
dependent upon drive level. 

Including the present experiment 
there now exists considerable support fo 
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the implication of the multiplicative 
theory that generalization gradient slope 
and drive level should be positively 
related. In addition to this experiment, 
the prediction is supported by the Porter 
(1962) experiment, and by the unpub- 
lished? absolute generalization data of 
Jenkins, Pascal, and Walker (1958). As 
was pointed out by Porter, data for three 
of the four drive levels employed by 
Thomas and King (1959) also conform 
to the prediction. The discrepant group 
in that study was that with the highest 
drive level used, a group of pigeons 
reduced to 60% of initial body weight. 
One interesting fact about the data for 
this group, is that the origin of the gra- 
dient at the training stimulus is actually 
below that for the next lowest drive level, 
a 70% group. This suggests the possible 
interpretation that this group may have 
been suffering from an inanition factor 
which depressed the maximum response 
rate. This would account for the flatter 
function obtained for this condition on 
the basis of a ceiling effect, imposed by 
the inanition factor, depressing the 
central region of the gradient. It is also 
possible that the difference in the slopes 
of the relative generalization functions 
obtained by Jenkins, Pascal, and Walker 
could be explained in the same manner, 
Adequate evaluation of this theory is not 
an uncomplicated matter, For example, 
it should be remembered that Hull (1951) 
did not assume that response potentia- 
tion was a monotonic function of dep- 
rivation conditions. Before attempting 
to investigate the quantitative effects 
upon generalization functions, one should 
be aware of the relations between the 
response measures used and the motiva- 
tional variable for ungeneralized re. 
sponses. It must also be recognized that 
the role of such factors as inanition may 
be expected to interact with such vari- 
ables as species, strain, age, maintenance 
conditions, and condition of health of Ss, 

There appears to be little support of 
the view, based on the assumption of 
impaired discrimination under high drive, 


? W. O. Jenkins, personal communication, 
1962. 


of an inverse relation between general- 
ization gradient slope and drive level. 
However, it scems clear that such results 
can be obtained under conditions in 
Which the maximum levels of the re- 
sponse measures are approached. Such 
maxima may either be inherent in the 
measures, or they may be introduced by 
experimental conditions which impair S's 
ability to respond. Perhaps flattening of 
gradients may also be obtained in cases 
of sensitization or pseudoconditioning, 
Where response evocation may be non- 
associatively determined. 

Relations between the response measures. 
—One result of some interest in the 
present experiment is the close agreement 
between the two response measures used 
in the generalization tests. There have 
been occasional references to rather low 
correlations between various measures of 
response strength; e.g., Kimble (1961). 
Sometimes such results are regarded as 
an argument against the utility of an 
excitatory or reaction-potential concept. 
Following a Procedure previously re- 
ported by Grice (1956), the correlation 
between the present two measures has 
been analyzed into within-conditions an 
between-conditions components. I 
obtained correlations between speed e 
the first response and responses in ex- 
tinction were .22 for within conditions, 
but .99 for between conditions. This 
latter value is, of course, the correlation 
between the group means. The between- 
conditions correlation was also .99 be- 
tween speed of the first response and the 
percentage of responses on trials remain- 
ing after the first response. The smooth 
curves drawn through the extinction 
Points of Fig. 2 are not fitted to the data, 
but are actually the same curves from the 
analysis of the speed data. The linear 
transformation of the ordinate is the 
linear regression equation. 

Correlations between response meas- 
ures usually reported are based on 2 
group of Ss, similarly treated, and are 
analogous to the present within-condi- 
tions correlation, A possible interpreta- 
tion is that when a group of Ss is 
subjected to extended training, system- 
atic individual differences in reaction 


STIMULUS GENERALIZATION AND DRIVE LEVEL 


are small in relation to be- 
havioral oscillation and measurement 
error. However, when systematic differ- 
ences in reaction potential are introduced 
between groups by the operation of 
experimental variables, high correlations 
may be obtained. The between-condi- 
ons correlation of .99 leaves little room 
for doubt that the measures are measur- 
ing the same thing, and that they are 
linearly related within the range in- 
cluded. A plot of the relation between 
first-trial speed and percent response on 
remaining trials is presented in Fig. 3. 
e main import of the figure is not 
merely the close relationship, which is 
Clearly indicated by the correlation 
Coefficient, but is the illustration of the 
Positions of the various group means. 
t is of considerable interest to note that 
the relative position of a given point is 
*termined jointly by the amount of 
8eneralization decrement and by the 
oe of deprivation. The fact that 
il prices fall along the same regression 
m Strongly suggests that both experi- 
“ntal variables are influencing the same 
hing, 
en din it is not to be expected that 
obtai a simple picture will always be 
then oo For example, there is evidence 
» In the runway, amount of training 
Po , resistance to extinction are not 
Sttively related (Ison, 1962). On the 
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other hand, the present data do not 
stand entirely alone. In an experiment 
with constant drive, but with general- 
ization in both directions, Grice (1956) 
has reported a between-conditions corre- 
lation of .89 over a somewhat more 
attenuated range of the same two vari- 
ables. Perin (1942) reported extinction 
data and initial latency in an experiment 
involving concurrent manipulation of 
both deprivation and amount of training. 
A plot of extinction trials and median 
initial speed (reciprocal latency), indi- 
cates that these data also fall on a single 
regression line, and the between-condi- 
tions correlation is .94. Taken together, 
these three sets of data suggest that, at 
least within a limited domain, depri- 
vation, generalization decrement, and 
amount of training influence a common 
underlying variable, which may be meas- 
ured by either response speed or resist- 
ance to extinction. Specifying the 
boundaries of this domain would appear 
to be a problem of considerable theo- 
retical importance. 
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STIMULUS GENERALIZATION AS A FUNCTION 
OF DRIVE SHIFT ! 


ROBERT B. ZAJONC axb DAVID V. CROSS 


University of Michigan 


6 pigeons reduced to 70% and 6 reduced to 90% of their body weight 
were trained to discriminate a 10-mm. circular spot from 4- and 16-mm. 
Following training, Ss in the 1st group were raised to 90% and 
Ss in the 2nd dropped to 70% of their body weight. All birds were 
tested for generalization to spots 6, 8, 10, 12, and 14 mm. in diameter. 


Following brief discrimination retraining the birds were returned to their 
original body weights and again tested for generalization. The general- 


À series of recent studies have 

Shown that a change in drive, follow- 

mg training, produces a Toterall iis 

Placement of the generalization gra- 

dient (Zajone & Dorfman, 1964). 

latg (1962) found such a displace- 

ment in generalization along the 

| loudness continuum as a function of 

di anges in conditioned fear. Similar 

wablacements were found when drive 

sti replaced by irrelevant extraneous 

mulation., For instance, Dorfman 

1961) found a lateral displacement in 

Seneralization to tactual stimuli as a 

p ction of changes in the amplitude 

Simp tory stimulation occurring 
uli taneously with the eliciting stim- 
' and Karp (1961) corroborated 
sti an'e findings using the same 
tra rus continuum and the same ex- 
“Neous stimulation but a somewhat 
e ee training procedure. These 
U'ts together with some findings m 
ber. of Sensory interaction bs 
ʻO indi Y Zajonc and Dorfman ( ) 
ions icate that drive not only fne 
la 35 an energizer of responses, DY 
t it has perceptual effects as well. 

Y assuming that the stimulus com- 


Ti 


y his st ogram 
= udy was part of a research progra 
Regearted by the Ait Force Office of Scientific 
to qh, Grant AF 49(638) 367. We wish 
ths tan David Carter for his assistance 17 
Abulation and analysis of results. 


ization gradients were found to shift toward larger stimuli when drive 
level was raised and toward smaller stimuli when it was lowered. 


ponent of drive interacts with the 
perception of the eliciting stimulus, 
either increasing or decreasing its 
apparent intensity, these authors were 
able to account for the lateral dis- 
placements of generalization gradients 
following a drive shift. 

'The experiments referred to above 
used human Ss to examine the effects 
of drive shift on stimulus generaliza- 
tion. In the present study pigeons 
were used to investigate size general- 
ization as a function of changes in the 
schedule of food deprivation. 


METHOD 


Twelve male, White Carneaux pigeons 
were used as Ss. All Ss were placed on 
restricted feeding until their body weights 
reached 80% of their free-feeding level. 

Apparatus.—A conventional experimental 
chamber for pigeons (Grayson-Stadler, 
E1100PB) was used. It was modified by 
mounting a small stripfilm projector on its 
side so that the stimulus (the only source of 
light in the chamber) could be focused on the 
pecking key. Seven stimuli were employed 
during the course of the experiment. These 
consisted of white circular spots on black 
backgrounds photographed and copied on 
35-mm. high contrast copy film. The stimuli, 
as projected onto the back of the pecking key, 
ranged in size (in 2-mm. steps) from 4 to 16 
mm. in diameter. 

Procedure.—Following the achievement of 
stable weight, all Ss were magazine trained 
and conditioned to keypeck with the key con- 
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tinuously illuminated by a 10-mm. diameter 
spot of light. After one session during which 
50 continuous reinforcements were provided, 
the pigeons were divided equally into two 
groups: Group I Ss were raised to 9095 of 
their free-feeding body weight and Group II 
Ss were dropped to 70% of their free-feeding 
level. Controlled feeding, during all of the 
training sessions that followed, maintained 
the body weight of each S within 5 gm. of 
that established. All Ss received 60 min. of 
fixed interval 15-sec. training to the 10-mm. 
spot, consisting of 100 15-sec. work periods 
separated by 15-sec. blackouts, Reinforce- 
ment consisted of access to the food magazine 
for 5 sec. All Ss were then given nine daily 
sessions of discrimination training during 
which each of three stimuli, (the initial 10- 
mm. spot and the 4- and 16-mm. Spots) were 
presented 16 times in random sequence for 
15-sec. periods separated by 15-sec. blackouts, 
Responding to the 10-mm. Spot was reinforced 
at the end of each Presentation period with 15 


guished. The Ss were run in pairs, one S from 
each group, matched for initial response rate. 
Each pair was run at the same time every 
alternate day, Following the last day of 
discrimination training each S was shifted in 
body weight; Group I Ss were shifted to 70975 
of their free-feeding level and Group II Ss 
were shifted to 90% of their free-feeding level. 
Three to 4 wk. were Permitted to elapse be- 
tween training and testing for each pair until 
the new levels were stabilized, Generalization 
testing consisted of repeated presentation of 
five stimuli (6-, 8-, 10-, 12-, and 14-mm. spots) 
under extinction conditions. The 4- and 16- 
mm. spots were omitted. Each stimulus was 
presented 10 times for 15 sec, according to a 
predetermined schedule of random permuta- 
tions of the five stimuli, Fifteen-second 
blackouts separated each Presentation. Each 
S of a given pair received the same random 
sequence of trials but different random se- 
quences were used for different Pairs. Three 
daily test sessions were given. At the end of 
each session the 10-mm. spot was Presented 
briefly (5 sec.) and responding was reinforced, 
Following these testing sessions, body weights 
were again shifted to the level Previously 
established for discrimination training and one 
additional session of generalization testing 
was given. Thus, Group I received training 
under low-drive conditions and generalization 
testing under high-drive conditions, followed 
by further testing under low drive. Group II 
received identical treatment under high drive, 
low drive, and again high drive. 
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RESULTS 


Figure 1 shows the effects of stim- 
ulus size and drive level on response 
frequency during each of 3 consecutive 
days of generalization testing. The 
Group I curves represent total re- 
sponding to each stimulus value for 
all of six birds trained under low-drive 
and tested under high-drive condi- 
tions. Although all birds in Group II 
(trained under high and tested under 
low drive) were given 3 days of test- 
ing, some failed to perform after the 
first or second day. Thus, the Group 
II curves represent response totals for 
six, three, and two Ss for Days 1, 2, 
and 3, respectively. It appears that 
the shape of the generalization curves 
for each group remained stable over 
test sessions. It can also be seen 
that birds for whom drive was in- 
creased during testing responded to 
stimuli 12 and 14 mm. in diameter at 
a higher rate than to stimuli 6 and 8 
mm. in diameter. For birds trained 
under high and tested under low drive, 
the converse is true, An analysis of 
variance of these response measures 
showed that both main effects, stim- 
ulus size, F (4, 50) = 30.342, p < .001, 
and drive level, F (1, 50) = 58.048, 
P < .001, were significant as was the 
effect due to their interaction, F (4, 50) 
= 4511, p < .01. These results are 
reported primarily to establish the 
fact that drive had a significant effect 
on overall rate of responding. Our 
chief interest, however, is in the 
lateral displacement of the general- 
ization functions. To control for the 
effects of drive on overall response 
rate, the above measures were normal- 
ized relative to the total responding 
for each S. The generalization func- 
tions relating proportion of total re- 
Sponses to stimulus size for the two 
8roups are shown in Fig. 2. Analysis 
based on these proportions provides 2 
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better test of our hypothesis which 
Was concerned with drive-produced 
'rectional asymmetries in the gen- 
*talization functions. An orthogonal 
Polynomial trend analysis was per- 
?Tmed. The results are summarized 
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in Table 1. A significant interaction 
effect was obtained (F = 6.302, 
p < .001) owing to differences in the 
asymmetric (linear and cubic) com- 
ponents of the two generalization 
functions. These differences are ex- 
hibited in Fig. 2 in the relative dis- 
placement to the right in the Group I 
gradients and to the Jeff in the Group 
II gradients. 

The results for individual Ss are 
shown in Fig. 3. The generalization 
curve obtained from each S is com- 
pared with that of the S with which it 
was paired in the experimental treat- 
ment. The response measures were 
normalized relative to the total num- 
ber of responses emitted by each S. 
The interaction effect is exhibited in 
the generalization behavior of all pairs 
of Ss: There is a modal shift for one 
bird trained under low and tested 
under high drive and in the Opposite 
direction for two birds which were 
trained under high and tested under 
low drive. The gradients of birds 
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tested under low drive are somewhat 
steeper than those for birds tested 
under high drive, a finding consistent 
with the results obtained by Thomas 
and King (1959). 

When Ss were shifted back to the 
body weight at which they were 
initially trained and then retested for 
generalization, a tendency to reverse 
the initial pattern of behavior was 
observed. Three Ss failed to perform 
to a criterion of the total emission of 
at least 500 responses during this 
session, and were thus not included in 
the calculation of results. Since 
Group I was originally tested under 
70% of body weight, the retest for 
this group involved a decrease in drive 
level. Group II, on the other hand, 
was originally tested under 90%, and 
the retest for these animals repre- 
sented an increase in drive. The dis- 
placement of the gradients as a result 
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of first and second drive shift is best 
seen when we compare normalized 
response rates to stimuli 6 and 8 mm. 
in diameter with those to stimuli 12 
and 14 mm. in diameter. The per- 
centage of total responses (excluding 
those emitted to the middle stimulus, 
10 mm.) is shown in Table 2. When- 
ever drive is increased the response 
rate to the two smallest stimuli is less 
than that to the two largest stimuli. 
This is true for the first test in Group 
I and retest in Group Il. A decrease 
in drive, on the other hand, leads to 
an increased response rate to the two 
smallest and decreased response rate 
to the two largest stimuli (Group 1). 
In Group I, however, while the re- 
Sponse rate to the smallest stimuli is 
higher and to the largest is lower on 
retest than it was on the first test, the 
second shift is quite weak. 
Considering that 2 mo. elapsed 


TABLE 1 
ANALYSIS OF VARIANCE FOR DATA IN Fic. 2 
Source df MS Tern d 
(Row) 
Stimulus effect (4) | (201,615.75) E 82,29*** 
1 po . 1 ...1,197.01 E.1 46 
- Quadratic 1 781,053.65 E.2 41.77 
3. Cubic. 1 6,931.20 E.3 2.55 
4. Quartic 1 17,281.07 EA 6.58* 
Between group means (Drive Effect) (1) 0 
Between group trends (Drive Effect X Stim- 
ulus Effect) (D (14,779.00) | E | scosr* 
1. Linear | 1 18,775.00 El 7.27 
2. Quadratic 1 3,146.05 E.2 1.68 " 
3. Cubic. 1 34,408.50 E.3 12.65* 
4. Quartic 1 2,786.70 E4 1.06 
Between individual means (Response Rate) (10) 0 
Between individual trends (40) (2,449.97) 
E F . 10 2,583.47 
2. Quadratic 10 1,869.76 
Ea Cubic 1 2,720.21 
4. Guartic 2,626.43 
* p c .05. 


**5 < OL. 
week p < 001. 
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Since the last day of training and that 
these birds were tested under low- 
rive conditions subsequent to 3 days 
9! extinction under high drive, it is 
Understandable that their behavior 
Would exhibit noticeable variability. 
n fact, the retest generalization 
“nections in Group I were displaced 
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to the left for two Ss, and to the right 
for the other two. 


DISCUSSION 


In the experiments quoted above by 
Dorfman (1961), Karp (1961), and Platz 
(1962) an increase in drive resulted in a 
shift of the generalization gradient to- 
ward stimuli greater in intensity than the 
conditioned stimulus, while a decrease in 
drive resulted in a lateral displacement 
toward smaller stimulus intensities. It 
is clear from the results of the present 
experiment that a lateral displacement of 
the generalization was also obtained as a 
function change in the level of hunger. 
However, in contrast with the previous 
findings, an increase in drive shifted the 
gradients in the direction of larger circles 
and a decrease in drive in the direction 
of smaller circles. A lateral shift in 
generalization gradient of a similar 
character was also found by Matlin 
(1960). In Matlin’s experiment Ss were 
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trained to make one response to a tone 
of medium intensity (77 db.) and another 
to one tone of lower (70 db.) and one tone 
of higher intensity (85 db.). Dynamom- 
eter pressure served to manipulate drive 
level. During training one group of Ss 
pressed a dynamometer to $ of their 
maximum and another to zy of their 
maximum. During testing observations 
were made of generalized responses to 
the three training and four other stimuli 
(73, 75, 79, and 81 db.). The generaliza- 
tion trials involved lowering drive for the 
former and increasing drive for the latter 
group of Ss. A significant lateral dis- 
placement of generalization gradients 
toward stimuli of greater intensity was 
obtained for the group trained under low 
and tested under high drive. A tendency 
in the opposite direction was observed 
for the group trained under high and 
tested under low drive. On the other 
hand, Carter and Zajonc (1962) in a 
similar attempt trained two groups of 
pigeons to keypeck for food reinforce- 
ment. One group was given such train- 
ing during the presence of a constant 
electrical stimulation high in intensity 
(.8 ma.) and another low in intensity 
C4 ma.) Responses were reinforced 
only in the presence of a 1,000-cps, 43.0- 
db. tone. Following training, shock in- 
tensity was increased for the low-shock 
group and decreased for the high-shock 
group and all Ss were tested for general- 
ization to seven auditory stimuli of 1,000 
cps varying in sound pressure from 27.0 
to 69.5 db. As in the other studies no 
reinforcement was administered during 
generalization testing. These investi- 
gators found that the generalization 
along the loudness dimension is displaced 
toward stimuli less loud than the CS 
when drive is increased following train- 
ing, and toward stimuli louder than the 
CS when drive level is decreased. These 
shifts were obtained, however, only on 
the initial trials following drive shift. 


In the light of these findings it is 
apparent that drive does in fact interact 
with stimulus generalization, and the 
Possibility of perceptual effects deriving 
from the stimulus component of drive is 
strengthened by the present results. 
However, no simple rule seems to de- 
Scribe the nature of this interaction, nor 
to predict the direction of the lateral 
shift. 
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SUBJECTIVE INTENSITY OF MINERAL TASTE IN WATER: 
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An experiment determined the ability of a power function to summarize 


ratings given solutions of NaCl, MgCls, and Na:SO,. 
solutions of a salt using a magnitude-estimation procedure, 


16 Ss rated 9 
A separate 


experiment was devoted to each of the 3 salts studied. Results in- 
dicated that a power function, v = k^, could account for a substantial 


portion of the variance in the ratings given. 


Further analyses of in- 


dividual ratings showed that power functions varied significantly be- 


tween Ss. 


It was proposed that the individual variation could be ac- 


commodated by an analysis of variance model. 


. The method of magnitude estima- 
tion, described by Stevens (1956, 
1958), has been used to obtain ratings 
9f the subjective intensity of stimuli 
alling along a prothetic continuum. 
"imuli that form a prothetic con- 
tinuum differ in physical intensity, 
ut they all belong to a single qualita- 
tive dimension. A substantial number 
Of different prothetic continua have 
cen investigated using magnitude- 
estimation procedures. A general 
nding has been that the relationship 
Stween the physical intensities of the 
Stimuli and their reported subjective 
Mtensities can be described by the 
equation y = ko" (Stevens, 1960) or 
Y the equation y = k(e — e)" 
(Stevens, 1962). 
Only a small amount of research 
aS been performed that employs 
Magnitude estimation to investigate 
Subjective intensity for the modality 
| taste. Stevens (1960) reports, 
Without detailing the exact methods 
employed, that McLean found ex- 
Ponents (x in the above equations) of 
8 for saccharine, 1.3 for sucrose, and 
M for sodium chloride. Jones and 
arcus (1961) determined exponents 


f H " 
hi ratings of sodium chloride 
UThe rese: 3 e 
arch reported in this paper was 
Tee ported in part by Fellowship Award WSP- 
r 3520) from the Public Health Service. 


solutions. The median exponent of 
the 98 computed was 1.05. These 
two researches appear to report the 
major data for tasted substances 
generated by magnitude estimation 
and described by a power function. 

The present study was conducted 
in order to obtain information con- 
cerning Ss’ ratings of the subjective 
intensity of the taste of solutions 
containing different amounts of a 
mineral solute. Three separate ex- 
periments were performed using the 
method of magnitude estimation. 
Sodium chloride was used as the solute 
in the first experiment. Magnesium 
chloride and sodium sulphate were 
used as solutes in the second and third 
experiments, respectively. The speci- 
fic procedures employed in the experi- 
ments were designed to provide a test 
of the adequacy of the power law 
equation, to analyze individual differ- 
ences in ratings, and to compare the 
power functions representing the three 
minerals rated. 


METHOD 
Solutions 


Minerals commonly found dissolved in 
water include calcium, sodium, and mag- 
nesium in combination with chloride, sul- 
phate, or carbonate (Babbitt, Doland, & 
Cleasby, 1962). After considerable pilot work 
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was completed, it was decided to use NaCl, 
MgCl, and NaSO, as solutes for study. 
None of these compounds has any odor, and 
all have a high level of solubility. 

In each experiment a series of nine solutions 
was employed. Using distilled water as the 
solvent and a reagent grade chemical as 
solute, solutions of 1,000; 2,000; 4,000; 7,000; 
10,000; 20,000; 40,000; 70,000; and 100,000 
mg. per liter (ppm) of NaCl were prepared 
for use in the first experiment. Series of 
solutions containing the same solute levels 
were prepared, when needed, for use in the 
MgCl: and NaeSO, experiments. Both NaCl 
and NasSO, were of anhydrous form, and 
their exact weights were used in preparing the 
nine solutions. The MgCl: was hydrated, 
having the formula of MgCl;:6H;0. Proper 
allowance was made for the hydration factor 
in preparing the MgCl, solutions. The error, 
in ppm of dissolved solute, for each solution 
in each series was less than 1% at the time the 
solutions were delivered from the chemical 
laboratories. 


Subjects 


Sixteen Ss served in all three experiments. 
Of these, 12 were males and 4 were females. 
The ages for the group ranged from 16 to 51 
yr. All Ss served voluntarily without pay. 

The sample of Ss employed was hetero- 
geneous in many respects, Age and sex 
variations have been noted. Educational 
level varied from students in high school to 
recipients of doctoral degrees. Occupation 
also varied widely. Stenographers, chemists, 
high-school students, professors, engineers, 
and housewives were included in the sample. 
Perhaps the best general description that 
could be given of the entire group of Ss would 
be to state that all resided in the San Francisco 
Bay Area and all had some connection with 
the University of California or the California 
State Department of Public Health. 


Procedure 


Before S arrived for an experimental ses- 
sion, E prepared two series of solutions. Each 
series contained the nine solute levels de- 

cribed above. Six standard solutions of 
1 000 ppm were also prepared. All solutions 
Per resented in 50-ml. beakers, filled to the 
ped level. The solutions to be rated bore 
ae erent scat save a single letter to allow 
a keep proper records. The standard 
eth Ta all marked with a capital “S,” 
m were clearly and easily identifiable. 
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Each experimental session was divided into 
two parts, with a 5-min. rest between parts. 
During each part, S rated nine solutions 
against the standard solution. A standard 
was presented first, followed by three un- 
known solutions. This block of solutions was 
presented three times. Thus, Part 1 of an 
experimental session consisted of three blocks 
of four solutions each, where each block con- 
tained a standard and three unknown solu- 
tions. After a 5-min. rest interval, the same 
procedure was repeated. The order of the 
standard solutions was known to S and fixed. 
The order of the solutions to be rated was un- 
known to S and determined by chance. 

The experimental sessions were conducted 
in a small air-conditioned room, the tem- 
Perature of which was constantly kept very 
near to 70°F. A laboratory table with a sink 
was used for the sample presentation and 
expectoration. A large electric clock was 
visible to S for use in timing rest periods. 

Instructions given to S before the start of 
each experimental session served to explain 
the rather complicated duties he was to 
perform. The entire experimental procedure, 
including the full set of instructions, was re- 
peated three times at intervals of about 30 
days for each S, 


Instructions 


The instructions given to each S at the 
beginning of each experimental session are 
presented below. 


[General Instructions] This is an e 
periment to scale the subjective intensity © 
salt solutions. You will be asked to give 
numbers that reflect how strong the solu- 
tion tastes to you in accordance with the 
instructions. “The standard solution 1S 
called 100. That is, it is given the ed 
100 to provide an "anchor" for a sca m 
Your job is to rate the remaining 9 sa 
solutions in terms of the standard and its 
number. If the 1st solution tastes twice if 
strong to you as the standard, call it D 
If it tastes half as strong, call it 50. Ifi 
tastes 16 times as strong, as the standarto 
call it 1600. If it tastes } as strong, call i 
25. TRY TO ASSIGN THE NUMBERS SO THAT 
THEY INDICATE HOW STRONG THE SOLUTION 
TASTES IN RELATION TO THE STANDARD- 
Do not feel overly constrained by numbers- 
Let your taste be the major factor 1n 
determining the number thai reflects how 
the various solutions compare with the 
standard. 

[Phase I] The exact procedure you are 
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to follow during Phase I of the experiment 
is outlined below. 

1. Take all of the standard into your 
mouth and hold it for about 5 seconds and 
then spit it into the sink. 

2. Fill the larger beaker with tap water. 
Rinse your mouth thoroughly with the tap 
Water. Repeat the rinsing until the salty 
after-taste disappea Do not rinse more 
than 5 times. Wait approximately 1 
minute. 

3. Take all of the first solution into your 
mouth and hold it for about 5 seconds and 
then spit it into the sink. 

4. Report your rating of the intensity of 
the solution to the experimenter. 

_5. Rinse your mouth thoroughly as in- 
dicated above. Wait approximately 1 
minute. 

6. Repeat steps 3, 4, and 5 until all 9 
solutions have been rated. Remember to 
rinse and wait 1 minute between tasting 
each solution. 

[Phase II] During Phase II of the 
€xperiment you are to rest for 5 minutes. 

it quietly in the chair during this time. 

o not leave the experimental room. 

[Phase III] During Phase III you are 
fo Tepeat exactly the procedures followed 

uring Phase I. These procedures can be 
Summarized as: 1) Taste the standard and 
Tinse; wait 1 minute. 2) Taste the Ist 
Solution and report the numerical rating. 

Rinse and then wait 1 minute. 4) Re- 
Peat these steps until all 9 solutions have 

een rated. 

was pj addition to these itistrnictions, boop s 
outlin d. à printed summary of the six step 
sy ed under Phase I above. The summary 
as placed on the laboratory table in front of 
to remind him of the details of the rating 
Mateuta, Multiples and fractions of 100 
i re included on the summary sheet. nae 
and d. in bold print that these maul Ds 
and Tactions were to serve as examples only, 
5 that they were not to be construed as 
Umbers that had to be used in the ratings. 


RESULTS 


The logarithm of a response which 
llows the power law can be expressed 
aS a linear function of the logarithm 
€ stimulus plus a random error. 

"d Hie power jd does not hold, the 
can lonship between the logarithms 
m Still be expressed as the power 
W, plus deviations from the power 


fo 
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law, plus a random error. Thus, the 
log of the response of any one indi- 
vidual for a particular stimulus can be 
written as 


log Vi; = log k + n log e; + 9; + Ei 


where y; is the response magnitude at 
the jth replication for the stimulus of 
the 7th intensity, 6; is the deviation 
from the power law at the 7th stimulus 
intensity, and E;; is random error. 
In the present study there were 
nine levels of stimulus intensity 
(i = 1::-9), measured by the amount 
of solute in solution, and 2 (j — 1,2 

replications at each intensity level. 
The power law applies if and only if 
all à; = 0, that is, if the regression of 
the log response on the log stimulus 
intensity is linear. It was assumed 
that E; was normally distributed, 
homoscedastic, and mutually inde- 
pendent. The third condition is 
assured by the experimental design, 
and failure of the first two does not 
substantially affect analyses of vari- 
ance (Scheffé, 1959). 

Least-squares estimates of log k 
and z, residual sums of squares, and 
sums of squares around regression 
were calculated for each of the 16 Ss 
for NaCl solutions. The same calcula- 
tions were also made for MgCl; and 
NasSO, solutions. Thus, a total of 48 
separate analyses of regression was 


TABLE 1 
Power Law Constants 
Constant Solute Group Value Range» 
NaCl .021 .000—- .289 
k MgCl: .021 .000-2.228 
NaSO; .027 .000-1.926 
All solutes 023 .000-2.228 
NaCl -920 -634-1.564 
» MgCl; .937 -498-1.598 
Na:SO, .892 -481-1.421 
All solutes .916 -481-1.598 


a 000" means zero to the nearest thousandth. 
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curve are simply estimates of log k (and 
thus of k) and n, the basic population 
constants. 

Jones and Marcus (1961) reported that 
an individual-modality interaction for 
exponents (analogous to nis in the above 
equation) could be accounted for by a 
product involving individual and modal- 
ity main effects. The occurrence of a 
significant individual-solute interaction 
for exponents obtained in the present 
study cannot be similarly explained as a 
product involving individual and solute 
main effects since the solute main effects 
were found to be nonsignificant. The 
application of Tukey's test for non- 
additivity confirms this reasoning by 
finding no significant multiplicative com- 
ponent in the individual-solute inter- 
action. Apparently the interaction arises 
from idiosyncratic reactions of Ss to 
particular solutes. The reactions in- 
crease the response for some Ss and 
decrease it for others, so that the overall 
average remains essentially constant 
from one solute to the next. 

It would be instructive to investigate 
the extent to which interaction of the 
type found by this study is due to 
casual divergences in the individual en- 
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vironments of Ss, as opposed to more 
fundamental differences. In the former 
case, a repetition of the experiment with 
a homogencous population, and with a 
shorter time between tests of the different 
solutions, might be expected to reduce 
the interaction sharply. 
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In 8 experiments involving a total of 230 rats the overlearning-reversal 


effect failed to occur in a brightnes: 


manipulation of a number of possib 


pattern of results was a sma! 
trials to reversal criterion, favoring 
trained Ss. 


The literature on the overlearning- 
reversal effect—the facilitation of re- 
versal learning by extensive postcri- 
terial training—is perplexing. After 
Reid’s (1953) initial demonstration 
that rats allowed extensive overtrain- 
ing on a brightness discrimination 
Would reverse faster than Ss reversed 
Immediately after reaching criterion, 
there subsequently appeared a number 
of confirming reports. For a time the 
Overlearning-reversal effect (ORE) 
Seemed so well-established that it 
found employment as an analytic 
tool in comparative psychology (e.g, 

Yookshire, Warren, & Ball, 1961; 
Warren, 1960; Warren, Brookshire, 
Ball, & Reynolds, 1960). At a some- 
what more general level, the facilita- 
tive effects of overtraining, of which 
the ORE is but a special case, has had 
Mportant theoretical consequences 
*-8., Mandler, 1962). 

ne of the first indications that the 

E might not be the general phe- 
nomenon it appeared to be came ina 
report of four separate failures to find 
an ORE in a position-reversal task 
(D'Amato & Jagoda, 1962). In very 
Short order these failures were verified 
Y others (Clayton, 1963; Hill & 

1 
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s-discrimination task despite the 
ly relevant variables. The usual 


l, nonsignificant, difference in numbers of 


the controls as often as the over- 


Spear, 1963; Hill, Spear, & Clayton, 
1962; Komaki, 1962), bringing to at 
least 11 the number of experiments 
in which the ORE could not be ob- 
tained in a position-discrimination 
task. On the positive side there is the 
initial success by Pubols (1956) and a 
more recent report by Capaldi (1963), 
though there may be others that have 
escaped us (cf. also Brookshire et al., 
1961). If the ORE is a genuine 
phenomenon in position-discrimina- 
tion reversal, the set of conditions 
under which it can be expected to 
occur must be very restricted indeed. 

Still, the major evidence behind the 
ORE comes from visual-discrimina- 
tion tasks, usually black-white dis- 
criminations, and as far as we know 
there is only one published report in 
which the ORE failed to appear in the 
visual-discrimination setting (Erle- 
bacher, 1963). But here too, after an 
initial marginal success (D'Amato & 
Jagoda, 1961), our efforts to lay hold 
of the ORE within a brightness-dis- 
crimination task have repeatedly met 
with failure. Because the phenom- 
enon seemed so well-established we 
were reluctant to publish our results 
until we had unearthed the variables 
responsible for our unsuccessful at- 
tempts. Having now exhausted all 
promising hypotheses as to the causes 
of our fruitless search, a record of our 
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studies may be of value to those who 
wish to concern themselves with this 
important, interesting, but apparently 
elusive, phenomenon. 


EXPERIMENT I 


It should be pointed out that the 
ORE, when it does occur, is an effect 
of considerable magnitude, statis- 
tically demonstrable with small num- 
bers of Ss. Consequently, we adopted 
the strategy of using relatively small 
groups of Ss in our sequence of experi- 
ments, intending to follow up promis- 
ing results with larger scale studies. 

The first experiment was essentially 
a repetition of our initial brightness- 
discrimination study (D'Amato & 
Jagoda, 1961), aimed at determining 
whether the relatively small difference 
obtained in that study (between 
Group C and Group C-200) was 
simply a chance result. 


Method 


Subjects —The 


Ss in Exp. I-VI, naive 
albino rats (45% 


males), were taken from 
litters bred in our laboratory under stand- 
ardized conditions, They ranged from 75 to 
140 days of age when beginning Pretraining. 

A pparatus.—Four single-unit automatic Y 
mazes, described in detail in à previous report 

(D'Amato & Jagoda, 1960), were used in this 
and subsequent experiments, 

Pretraining.—Four days prior to the be- 
ginning of discrimination training Ss were 
placed on a 23-hr. water-deprivation Schedule, 
food being available ad lib. The only pre- 
training given in the Y mazes consisted in 
permitting S to drink five dippersful of water 
on the day prior to the beginning of training; 
three dippersful were allowed on the sub- 
sequent day. This pretraining procedure was 
followed in all experiments, 

Discrimination training. —All Ss were run 
20 trials per day on a brightness-discrimina- 
tion problem until the criterion of 18 correct 

hoices out of 20 successive free trials, with 
om last 10 correct, was met. Throughout 
dir dn 20% of each hm So x trials tice 
forced by locking closed the door to one of the 


ms. Two of the forced trials were to S+ 
arms. 
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and two were to S—. For 12 Ss the positive 
stimulus was the bright arm (10.0 footcandles 
[ftc.]), and for 8 Ss it was the dim (0.1 ftc.). 
Since the results were comparable for the two 
conditions, this variable was not maintained 
as a factor in subsequent statistical analyses. 

The “reward interval" during which S 
had access to a full water dipper was up. 
proximately 2 sec.; the "stimulus interval, 
the period during which the discriminanda 
remained on in the maze after the dipper was 
retracted, was 8 sec.; the intertrial interval 
(ITI), spent in near total darkness, was about 
60 sec.; finally, “the starting interval,” the 
Period signaling the beginning of the next 
trial, lasted some 5 sec. (For further ex- 
Planation see D’Amato & Jagoda, 1960.) 

Upon reaching acquisition criterion Ss 
were assigned to a control group (Group C) 
and to an overlearning group (Group O) in 
Such a way as to balance the two groups for 
mean acquisition scores, e 

Overtraining and reversal.—Overtraining 
consisted of 240 free trials, distributed 20 D 
day. Aside from the absence of forced trials 
Overtraining was identical to acquisition 
training. During reversal, 20 free trials were 
given daily until .Ss met the same criterion as 
employed in acquisition; in addition, all Ss 
Were run a minimum of 160 trials. 


Results 


Acquisition and overtraining.—Since 
the control and Overtrained groups 
were always closely equated on ac- 
quisition scores, acquisition data will 
not be presented for each individual 
experiment. The mean numbers © 
trials required to reach acquisition 
criterion ranged between 63.3 am 
O75, À 

Errors committed during overtrain- 
ing varied between 1.8% (Exp. VII) 
and 5.7% (Exp. IV). 

Reversal.—There was no sign of an 
ORE in the trials-to-criterion der 
Group C (N = 11) required 145. 
trials on the average to reverse, while 
for Group O (N = 9) the correspond- 
ing figure was 153.2. A trend analysis 
showed the linear components of the 
reversal curves not to differ signifi- 
cantly, F (1, 18) = .84. 
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EXPERIMENT II 


We now embarked upon a series of 
experiments in an attempt to uncover 
the factors that prevented the occur- 
Tence of an ORE in our experimental 
Situation, the first experiment being 
Motivated by the following hypoth- 
esis. Let us suppose that small stim- 
ulus changes are usually associated 
With the transition from acquisition 
to reversal learning. For example, if 

S are run in squads, due to the cir- 
cumstance that Ss slow up when re- 
versed, the ITI will probably increase 
during the early portions of reversal 
earning. This change, as well as 
Others that might occur on the transi- 
tion from acquisition to reversal, ought 
to be more readily discriminated by 
the Overtrained Ss due to their ex- 
tended experience with the stimulus 
conditions of acquisition. Conse- 
quently, such changes should be more 
disruptive to overtrained Ss and result 
it relatively faster extinction of the 
9rmerly correct response. If there 
Were anything to this line of reasoning 
it Could explain our prior failures to 
obtain the ORE, since the highly con- 
rolled conditions of our experimental 
“uation permit of no stimulus 
shanges in reversal other than the 

ifting of reward from the former S+ 
© the former S—. In any event, the 
2 pbropriateness of this hypothesis 

S explored in the present experi- 
E ent by imposing a stimulus change— 
eduction of the stimulus interval— 

Wing reversa] learning. 


Methog 


was this and subsequent experiments S+ 
free. 26 bright arm and all trials were run 
stim, € reward interval was 2 sec., and the 

sec, po terval, 8 sec. There followed a 
€Xtingy: during which the stimuli were 
exposa Shed; consequently, S's postchoice 
10 mee to the discriminanda amounted to 
en trials were given on Day 1 of 
and 20 per day thereafter. Over- 


ulus 


acquisition 
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training consisted of 240 trials beyond the 
acquisition criterion. 

Reversal.—The postchoice exposure to the 
discriminanda was reduced during reversal 
from 10 to 4 sec. by reducing the stimulus 
interval from 8 to 2 sec. To maintain the 
time between choice and the start of a sub- 
sequent trial constant, 6 sec. was added to the 
ITI, increasing the latter to 60 sec. As usual, 
the ITI was spent in darkness. 


Results 


The mean numbers of trials to 
reversal criterion were 151.4 for Group 
C (N = 12) and 136.9 (N = 12) for 
Group O. The difference, though in 
the “right” direction, is small and does 
not approach significance, / < 1. 


EXPERIMENT III 


In this experiment we pursued the 
differential generalization decrement 
notion further by introducing a more 
imposing change between acquisition 
and reversal, at the same time employ- 
ing larger groups of Ss. 


Method 


The only difference between the procedures 
of the present experiment and those of Exp. II 
was that, during acquisition and overtraining, 
the discriminanda remained on for the dura- 
tion of the ITI. Consequently, during ac- 
quisition and overtraining S was exposed to 
the discriminandum in the arm of his choice 
throughout the trial sequence (a total of 64 
sec.), until the beginning of the starting 
interval of the next trial. During reversal 
the postchoice duration of the discriminanda 
was reduced to 4 sec., and S spent the next 
60 sec. in darkness. All Ss were run a 
minimum of 160 reversal trials. 


Results 


The mean numbers of trials to re- 
versal criterion were 134.8 for Group 
C (N = 20) and 130.1 for Group O 
(N = 18), respectively. The absence 
of an ORE in the trials-to-criterion 
data is obvious. The difference be- 
tween the linear components of the 


380 


to the ORE than are our own animals. 
The difference between the linear 
components of the reversal curves was 
small, F (1,17) = .05. 


EXPERIMENT VIII 


The last experiment to be reported 
had two purposes. Earlier work had 
shown that acquisition of a brightness 
discrimination is consistently impeded 
in our Y mazes when the discrimi- 
nanda are permitted to remain during 
the ITI (D'Amato, 1960). One aim 
of the present study was to ascertain 
whether this result extends to pig- 
mented Ss. Since albinos alone had 
been used in Exp. I-VII, we continued 
on through Overtraining and reversal 
on the chance that the pigmented Ss 
might show some signs of an ORE, 


Method 


Subjects—The Ss were 28 naive male ratss 
80-100 days of age at the start of the study. 
Drawn from three hybrid litters bred in our 
laboratory, hooded black and brown .Ss 
accounted for 26 of the Ss, and the remaining 
2 were albinos. 

Acquisition.—The Ss were trained 20 trials 
per day on the usual brightness-discrimination 
problem. For half of the Ss, randomly chosen, 
the discriminanda remained on during the ITI 
(Group A); for the other half (Group B) the 
discriminanda were extinguished. The two 
groups were otherwise treated alike. The 
reward, stimulus, intertrial, and starting 
intervals were 2, 8, 60, and 5 sec, Upon 
reaching criterion each S was assigned to 
either a control or an overlearning group, 
making four groups in all, Groups AC, AO, 
BC, and BO (N = 7). 

Overtraining and reversal.—Overtraining 
consisted of the usual 240 trials. All Ss were 
runa minimum of 140 reversal trials, Perhaps 
it should be explicitly stated that Ss of Cond. 
A remained in the presence of the discrimi- 
nanda during the ITI throughout overtraining 
and reversal. 


Results . ; 
uisition.—1n verification of ear- 

T epar the 14 Ss trained with the 

decriminanda present during the ITI 
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required significantly more trials to 
learn the brightness discrimination 
than those in the B condition, the ap- 
propriate means being 82.6 and 65.1, 
respectively, ¢ (26) = 3.00, p < .01. 

Reversal.—Control Groups AC and 
BC averaged 170.4 and 134.1 trials, 
respectively, to meet the reversal cri- 
terion. For the corresponding over- 
trained groups these values were 154.7 
and 132.4. Obviously the differences 
between the control and overtrained 
groups are quite small; evaluated by à 
two-way analysis of variance, F (1, 24) 
— 46. On the other hand, as one 
might expect from the differences 1n 
acquisition, Ss of Cond. A take e 
nificantly longer to reverse than Ss 0 
Cond. B, F = 5.24, p < .05. For the 
interaction term F = .30. A pod 
trend analysis of the first 7 reversa 
days showed none of the differences 
between the linear components to be 
significant, 


Discussion 


None of the variables investigated in 
the eight experiments has been ee 
in producing an ORE, nor is there oe 
indication that a particular combinati 
of these variables would prove success e 
The pattern observed repeatedly was à 
small, nonsignificant, difference betwee! 
the control and overtrained granai 
favoring one group as often as the other, 
frequently combined with a tendency 
for the reversal curve of the pui 
group to be slightly steeper than gs 
controls. This pattern is reflected » 
Part by the combined results of the eigh 
experiments, the control (N — 116) ee 
overtrained (N = 114) groups ayeme 

5 and 80.6 trials to acquisition x 
terion, respectively, and 134.3 and 141. 
trials to reversal criterion. 1l 

Obviously we have not explored @ 
possibilities: motive-incentive conditions, 
task difficulty, as well as a number A 
other possibly relevant variables were 
not investigated, and, of course, the sam! 
apparatus was used in all experiments: 
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But we believe that we have studied a 
sufficient number of variables to demon- 
strate that the ORE, even in visual-dis- 
Crimination studies at the rat level, is 
by no means a general phenomenon, a 
Conclusion bolstered by Erlebacher’s 
(1963) failure to find an ORE with con- 
ventional apparatus and hunger-moti- 
vated Ss. The conclusion seems war- 
ranted, therefore, that though the ORE 
may be a genuine phenomenon, the set 
of conditions under which it can be ex- 
Pected to occur must be quite specific and 
cannot be accurately delineated at the 
Present time. 

Specificity of results to highly re- 
Stricted sets of conditions is not, however, 
an affliction solely of the ORE. Judging 
from the inconsistencies of results and 
the failures to replicate so prevalent 
in contemporary psychological research 

€spite the reluctance on all sides to 
Publish “negative” results), the gen- 
Crality of the empirical relationships in 
Psychology may be more limited than 
Most of us now realize. If so, the con- 
tinued accumulation of isolated em- 
Pirical relationships is not likely to be of 
any great value in accelerating our under- 
Standing of behavior. Perhaps of greater 
Priority is the development of a set of 
Principles which will point up the im- 
Portant dimensions along which the 
Senerality (invariance) of interesting 
“mpirical relationships ought routinely 
di be evaluated. The problem is a 
areali ong but it must be ee E mE 

oid being buried under a grow1 
Pyramid of disjointed and conflicting 
up ime of data, highly restricted to 
ifiable sets of conditions. 
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A distribution function representi 
was derived, assuming RT is the 
exponential and normal distributi 
auditory stimulus following each oi 
tions: (a) foreperiod constant wit 
and (b) foreperiods appearing in a 
tribution function 
all 32 RT distributions 
function to foreperiod suggest th 
foreperiod is due to variation in 


Concern with the form of distribu- 
tions of reaction times (RT) has been 
focused largely on the fact that the 
nonnormal character of these distribu- 
tions tends to preclude application of 
the more common statistics in studies 
using RT as a dependent variable. 
Some attempts have been made 
recently, however, to relate certain 
properties of these distributions to 
hypothetical components of the ob- 
served response times. 

Restle (1961), for ex 
ing an almost constant 10 to 1 ratio 
between the means and standard 
deviations of 15 distributions of RT 
(data from Chocolle, 1940), inferred 
that the gamma distribution function 
might describe these distributions, 
This would imply that an observed 
RT must consist of the sum of a fixed 
number of independent component 
times, each of which is a random 
variable with a common exponential 
distribution. From the observed ratio 
of means and standard deviations in 
Chocolle's data, Restle further in. 
ferred that the number of components 
must be on the order of 100. Unfor- 
tunately, as McGill (1963) has pointed 
out, this interesting result does not 
-orrespond to commonly observed 
corres] According to the central-limit 


ample, obsery- 


data. 


ng simple reaction-time distributions 
sum of 2 component variables with 
ons. 4 Ss each gave 100 RTs to an 
f4 foreperiods, under each of 2 condi- 
hin sessions but varied over sessions, 
random sequence. The derived dis- 


provided completely satisfactory representations of 
; and the relations of th 


€ fitted parameters of this 


at the variation of RT asa function of 
the normally 


distributed component. 


theorem, a gamma distribution with E 
parameter indicating more than, say 
10 or 15 components would be very 
nearly normal. But observed dis- 
tributions of simple RTs obtaine 
under constant conditions seem in- 
evitably to be positively skewed. A 

McGill (1963) observed that the 
upper portions of plots of i, 
cumulative distributions of simple R 
to different auditory stimulus inr 
sities all had the form of exponentia 
distribution functions withsimilar time 
constants. This suggested that x 
least one component of the total R 1 
was exponentially distributed; ate 
since the time constants implied by 
the curves seemed to be nearly inde 
pendent of stimulus intensity, McGi 
assumed that this component is the 
time required to make the — 
motor response. Furthermore, sinc 
the overall distributions clearly pe 
with intensity of the stimuli, it ee 
assumed that the RTs per 
a nonexponential component owe 
might be interpreted as “decisio 
time.” 

As part of a model intended to 
represent the structure of ec 
RT, Christie and Luce (1956) pue 
that an observed response de 
composed of an exponentially distri 
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uted choice or decision time plus a 
variable "base" time or motor re- 
sponse time, the distribution of which 
was not specified. Both McGill and 
Christie and Luce thus assume that 
RT distributions result from convolu- 
tions of distributions of two compo- 
nent random variables, one of which 
has the exponential distribution. 
Both also hypothesize that the two 
components represent decision time 
and a residual latency which is pri- 
marily motor response time. But here 
the agreement ends: the two papers 
present directly opposite interpreta- 
tions of the exponential component. 
McGill (1963) assumed it is the “move- 
ment response [p. 329]," whereas 
Christie and Luce (1956) assumed it 
to be “decision latency [p. 25)" 
Clearly, interpretation of the com- 
Ponent processes underlying reaction 
latencies would be facilitated by 
knowledge of a distribution function 
Which accurately describes distribu- 
tions of total observed reaction times. 
‘he aim of the present study is to 
Provide evidence for such a function. 
First of all, it is known that the 
distribution of the sum of a large 
number of random variables is asymp- 
totically normal regardless of the 
distributions of the component vari- 


Strictly speaking, Equation 1 is 
an approximation: its derivation en- 
tailed the restriction of both f and fe 
to Positive values whereas the assump- 
tion of normality for fe requires that 
this variable can take any real value. 

he approximation is very close, how- 
€ver, as long as the ratio of the mean 
to the standard deviation of the 
Normal variable is of an order of 3 or 
More, 

Equation 1 provided good repre- 
Sentations of 12 distributions of RT 


1 e "ü y [E 
dim] n e-zdy|e-^s tz 
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ables, provided no one component 
contributes disproportionately to the 
variance of the sum (Cramer, 1946); 
therefore, if it is assumed that an RT 
is the sum of time intervals to com- 
plete a sequence of separate processes, 
then the distribution of this sum 
would necessarily approach normality 
unless one (or some very small num- 
ber) of the components contribute 
substantially more variance than any 
of the others. 

The simplest hypothesis seems to be 
that an observed RT results from 
summation of (a) a single dominant 
exponentially distributed variable as 
proposed by McGill (1963) and by 
Christie and Luce (1956), and (b) a 
normally distributed variable which 
may itself be the sum of a number 
of component variables with similar 
variances. 

To test this hypothesis the proba- 
bility density and cumulative distribu- 
tion functions are needed for the 
variable! = tı + £s, where !; and fy are 
the exponentially and normally dis- 
tributed variables. Lettinga = E(t), 
b = E(t), and c = ø (t2), and assuming 
i1, and t: are independent, the cumu- 
lative distribution function for £ was 
determined to be 

[-b_¢ 


[1] 


c a 


to onset of low-intensity visual stimuli 
(.05 and .125 footlambert (ítL.) 
against a 1.8-ftL. background)! In 
these data the fitted parameter a was 
markedly different for the two in- 
tensities, but the fitted b was not 
related to this variable. A tentative 
interpretation of these results was 
that RTs do contain an exponentially 
distributed component and that this 

! Data. presented at the Midwestern Psy- 


chological Association meetings at Chicago, 
May 1963. 
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component is the decision or “percep- 
tion” portion of an RT. The remain- 
ing portion, the mean of which is 
represented by the parameter b, was 
assumed to be the time required for 
organization and execution of the 
motor response. 

The present study was designed to 
provide a further test of Equation 1 
and to attempt to manipulate the 
normally distributed component of 
simple RTs. It has been generally 
assumed that effects of different fore- 
period durations on simple RT are due 
to changes in 5's preparatory set dur- 
ing the prestimulus delay. And evi- 
dence seems to indicate that, for 
clearly suprathreshold stimuli, the 
preparatory set is primarily muscular 
(Teichner, 1954). If this is indeed the 
case, and if the assumption is correct 
that a normally distributed compo- 
nent of RT includes motor response 
time, then it should be possible to 
account for variation in RT due to 
foreperiod in terms “of variation of 
the 6 parameter in Equation 1 as a 
function of foreperiod. 


METHOD 


Distributions of simple RT to an auditory 
stimulus were obtained with four different 
foreperiods under two conditions: (a) con- 
stant foreperiod within sessions, and (b) ran- 
dom sequences of foreperiods within sessions. 

Each trial began with onset of four panel 
lights surrounding a 4-in. speaker. After a 
delay (foreperiod) of 1.60, 3.20, 4.65, or 6.13 
sec., a 62-db. (re .0002 dynes/cm?), 1,000- 
cycle tone appeared from the speaker, which 
was S’s signal to press a microswitch button 
as quickly as possible. The response ter- 
minated both the lights and the tone. There 
was a constant 1-sec. interval between S's 

response and the beginning of the next trial. 
Ambient background noise in the experi- 
mental room was 35 db. " 
Four 14-yr.-old female Ss each participated 
in a practice session followed by eight 130- 
trial sessions. Only the last 100 trials from a 
session were retained for analysis, the first "x 
being treated as warm-up trials. E. eig E 
sessions were spread over 10 days with no < 


participating in more than one session on any 


one day. 


During the first four sessions the fore- 
periods were presented in a random sequence 
for all Ss; during the last four the foreperiod 
was constant within a session but varied over 
sessions. Two Ss received the constant fore- 
periods in descending order, the other two in 
ascending order. A distribution of 100 re- 
sponses was thus obtained from each S for 
each foreperiod under each of the two fore- 
period conditions. 

For each of the 32 distributions, 10 inter- 
vals of response times were selected such that 
10% of the observations fell in each interval. 
Parameters in Equation 1 were then deter- 
mined by successive approximations such 
that this distribution function assigned as 
nearly as possible 10% of the probability to 
each of the 10 intervals. The criterion of 
closeness of fit was a chi-square statistic based 
on deviations of observed from expected 
frequencies in the 10 intervals, 


RESULTS AND CONCLUSIONS 


The correspondence between Equa- 
tion 1 and the four Ss' distributions of 
RTs following the shortest foreperiod 
used is indicated in Fig. 1. The 
upper four distributions are those ob- 
tained with this foreperiod occurring 
in a random sequence; the lower four 
were obtained with the foreperiod 
constant. That this generally excel- 
lent correspondence was uniform over 
the 32 sets of data is indicated by the 
distribution of the goodness-of-fit 
statistic: Under the null hypothesis— 
ie. the hypothesis of perfect fit of 
Equation 1 except for random sam- 
pling error—this statistic would be 
distributed as a chi-square variable 
with df = 6. Thus ifa perfect fit is 
assumed, the expected value of the 
Statistic for any one distribution 
would be 6, and the expected value 
of the sum of the 32 (independent) 
Statistics would be (32)(6) = 192. 
Obtained values of the 32 statistics 
ranged from .65 to 11.85, and their 
Sum was 172.17. 'The median value 
was 4.92 which may be compared to 
an expected median of 5.35 on the 
basis of random sampling. 

Figure 2 shows the obtained rela- 
tionships of mean RT and its inferred 
components to foreperiod duration 


-— — 


COMPONENTS OF RT AND THE FOREPERIOD 385 


1.00 1.00 

80} 80 
> 60 60 
5 ; 
z 
ü 
2 40 40[ 
e 
z 
w .20 .20 
w 
= o o 
E 
< 
- 
Ù 1.00 1.00 
a 
V eo} 60 
A fo 
sco} CW-R 60 SB-R 
3 
Oo 40} Ao 

20 + 20 

o —— L — 
o 200 250 300 350 400 450 


200 250 300 350 400 450 


CUMULATIVE RELATIVE FREQUENCY 


1.00 1.00 
80 80 t 
m | 60 i SB-C 
dil: 40 H 
| «| 
o li L — 1 
ou "Lo 200 250 300 350 150 200 250 300 350 


RT IN MILLISECONDS 


d to the four Ss’ cumulative distributions of RTs to a 62-db., 1,000- 
f 1.60 sec. (The plotted points are the second, fourth, sixth, 
d ninety-sixth percentiles, and every fifth percentile from the 


a l. Equation 1 fitte 

ninet: tone with a foreperiod o 

itke ona, ninety-fourth, anı 
to the ninetieth.) 


386 
een ds CONSTANT FRPRD 
NE ic ce---5 RANDOM FRPRD 
3 240 
E230 
& 
z 220 
Ei 
220 
200 
t 2 3 4 5 6 
229) *— —5 CONSTANT FRPRD 
S 210 Seo œ--- RANDOM FRPRD 
E] Seg 
= 200 ees b 
2 
m 190 
5 
wW 180 
z 
& 170 
= n n 
f 2 3 4 5 6 
3 eo 1 
2 *— —5 CONSTANT FRPRD 
= so e---9 RANDOM FRPRD 
e aof = ==- uL 
= 30 
Shy en, à 
= 2 3 4 NC 
FOREPERIOD (SEC.) 
Fic. 2. Relations of mean RT and its in- 


ferred components to foreperiod duration. 
(The parameter a is the mean of a hypothet- 
ical exponentially distributed component; b is 
the mean of a hypothetical normally dis- 
tributed component.) 


under the two conditions. The mean 
R'T curves are very similar to those 
reported in other studies (e.g., Karlin, 
1959) ; the shortest foreperiod appears 
to have]been disruptive when it oc- 
curred in a random sequence including 
longer foreperiods, but resulted in the 
shortest RTs when it was constant 
from trial to trial. Figure 2 suggests 
that most of this effect can be ac- 
counted for as differential variation 
of b, the normal component, with 
foreperiod under the two conditions. 
A Foreperiod X Constant-vs.-Ran- 
dom X Ss analysis of variance for 
mean RT and for each of the fitted 
parameters indicated that (a) the 
Foreperiod X Constant vs. Random 
interaction effect on mean RT was 
significant at the .05 level, F (3,9) 
= 6.61; (b) the corresponding ob- 
tained F for the b parameter Ju rd 
which is very close to the critical va ue 
3.86) required for significance at the 
Cs level; while (c) this E 
effect on the parameter a did not ap 
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proach significance (F = 1.46); and 
(d) although the parameter c, inter- 
preted as the standard deviation of 
the normally distributed component, 
showed considerable variation among 
Ss, its variations due to Foreperiod 
and Constant vs. Random conditions 
were not statistically significant. The 
range of estimated c's over the 32 sets 
of data was from 10 to 42 msec. 

The distributions of RTs to low- 
intensity visual stimuli mentioned 
earlier and the auditory RT data re- 
ported here were thus both consistent 
with the hypothesis that an observed 
distribution of simple RTs may be 
described by a convolution of a 
normal and an exponential distribu- 
tion. Moreover, the apparent differ- 
ential dependence of the means of 
these inferred component distribu- 
tions on stimulus intensity and fore- 
period duration strongly suggests that 
the component distributions are in 
fact associated with times to complete 
different processes underlying ob- 
served RTs. 
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The effects of list differentiation on retention following massed and 
distributed practice were evaluated by measuring Ist-list retention, or 
RI, in an A-B, A-C situation. A-C learning was either by massed or 
distributed practice. A-B and A-C learning were either by the same 
or different practice methods. 1st-list recall was better when A-B and 
A-C were learned by different procedures than when they were learned 
by the same procedures. 1st-list recall following distributed A-C learn- 
ing did not differ from Ist-list recall following massed A-C learning. 


Underwood, Keppel, and Schulz 
(1962) have attributed the facilitative 
effects of distributed practice (DP) 
to a successive extinction-recovery 
Process whereby interfering associa- 
tions are more effectively and perma- 
nently extinguished during distributed 
learning than during massed learning. 
In discussing their finding that A-E 
recall in an A-B, A-C, A-D, A-E 
situation was greater following dis- 
tributed A-E learning than following 
Massed A-E learning, these authors 
Suggested that the successive A-E 
distributed practice intervals allowed 
for greater recovery and therefore 
More effective extinction of the asso- 
Ciations established between the con- 
text stimuli and the responses of the 
first three lists during A-E learning. 

hus, at the time of A-E recall 24 hr. 
after A-E learning, proactive inhibi- 
tion (PI) associated with the inter- 
ering context associations would have 
been weaker following distributed A-E 
farning than following massed A-E 

This arried out during the 
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learning, allowing the facilitative 
effect to appear. 

There is another process, involving 
list differentiation, which could con- 
tribute to the facilitative effects of 
DP. In the Underwood, Keppel, and 
Schulz experiment all Ss learned the 
first three lists by massed practice 
(MP). A DP group then learned the 
A-E list by DP while an MP group 
learned A-E by MP. Thus, the MP 
group learned all lists by MP while 
the DP group learned the A-E list by 
a procedure which differed from the 
conditions under which the first three 
lists were learned. The superior A-E 
recall following DP may have been 
due, in part, to the DP Ss’ ability to 
differentiate the lists on the basis of 
the difference between the practice 
methods employed in learning the 
lists. The MP Ss, because they 
learned all lists by MP, would be 
subject to a greater degree of list 
confusion than would the DP Ss. The 
implication is that the superior A-E 
recall demonstrated by the DP group 
may have been a function of both 
differential extinction-recovery and 
list differentiation. ‘ 

The present experiment was de- 
signed to evaluate the list differentia- 
tion interpretation of the DP effect by 
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measuring retention of first-list asso- 
ciations 1 min. after A-B, A-C 
learning. This retroactive inhibition 
(RI) paradigm was employed because 
it was thought that RI would be a 
more direct measure of the effects of 
DP and MP on previously learned 
associations than would a PI measure 
as has been used in previous studies. 
The main effects of the 2 X 2 factorial 
design were massed vs. distributed 
A-C practice and same vs. different 
practice procedures employed in learn- 
ing the A-B and A-C lists. The four 
main conditions were massed A-B 
learning followed by massed A-C 
learning (MP-MP), massed A-B fol- 
lowed by distributed A-C (MP-DP), 
distributed A-B followed by dis- 
tributed A-C (DP-DP), and dis- 
tributed A-B followed by massed A-C 
(DP-MP). The list-differentiation 
hypothesis predicts that first-list re- 
tention will be greater in the two 
paradigms involving different A-B and 
A-C practice procedures (MP-DP, 
DP-MP) than in the two conditions 
involving the same A-B and A-C prac- 
tice procedures (MP-MP, DP-DP). 
A prediction with respect to the ex- 
tinction-recovery hypothesis is more 
difficult because there has been no 
clear statement as to whether DP is 
expected to result in a lower initial 
level of extinction as well as slower 
recovery. If it is assumed that the 
extinction-recovery process implies a 
greater loss of Context B associations 
during DP than during MP then it 
may be predicted that retention 
of first-list associations in the two 
distributed A-C learning conditions 
(MP-D P, DP-DP) will be less than 
the first-list retention in the two 
A-C conditions (MP-MP, 


massed 
DP-MP). 
METHOD 
General. —All Ss learned two successive 


paired-associate lists conforming to an A-B, 


A-C paradigm. Depending upon the list and 
the condition, learning was either by MP or 
DP. The Ss were given nine anticipation 
trials on both the A-B and the A-C lists. 
Distributed intervals were filled with a digit- 
cancellation task. Following A-C learning 
all Ss were given a recall trial and four addi- 
tional relearning trials on the A-B list to 
evaluate RI. It was assumed that differences 
in the Context B associations would be re- 
flected in the recall of specific A-B associations 
because the context association is a prerequi- 
site of the specific association; i.e., a response 
must be available if it is to be associated with 
a specific stimulus. . 
The general design called for four practice 
sequences (MP-MP, DP-DP, MP-DP, DP- 
MP). The variations in second-list practice 
procedures presented a problem in that 10 
distributed A-C trials take longer than 10 
massed A-C trials, making the interval be- 
tween the end of A-B learning and recal 
longer for the MP-DP and the DP-DP condi- 
tions than for the MP-MP and DP-MP con- 
ditions. This problem was eliminated, and 
the time between A-B learning and reca 
equated for all conditions, by having the 
MP-MP and DP-MP groups cancel digits for 
9 min. (the amount of time the DP-DP an 
MP-DP groups canceled digits during dis- 
tributed A-C learning) either just before OF 
after A-C learning. This was accomplishe 
by splitting the MP-MP and DP-MP condi- 
tions into two groups each. The MP-C-MP 
and the DP-C-MP groups canceled digits be- 
tween A-B and A-C learning while the MP- 
MP-C and the DP-MP-C groups cancele 
digits between A-C learning and recall. Thus 
the complete design, including the groups 
developed to account for the differences !^ 
time required to learn by MP and DP, ja 
quired six groups (MP-DP, DP-DP, DP-C 
MP, DP-MP-C, MP-C-MP, MP-MP-C). ; 
Subjects—The Ss were 90 University © 
California undergraduates, 15 per grouP' 
whose participation was in fulfillment of a 
course requirement. The first 45 Ss receive 
MP on the A-B list and were random y 
assigned to one of the three A-C conditions 
(DP, MP-C, C-MP) as they appeared for the 
experiment. The remaining Ss received 
on the A-B list and were randomly assigne 
to the A-C conditions. TT 
Lists.—The A-B list for all Ss was x 
in Table 1 of the Underwood and Schutz 
(1961) experiment. The stimulus terms 
were nonsense syllables and the response 
terms two-syllable adjectives. The A-B am 
A-C stimulus terms were identical. The A- 
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response terms were the adjectives from List 2 
of Table 1 (Underwood & Schulz, 1961). 
Procedure,—Lists were presented at a 2:2- 
Sec. rate on a Stowe drum. Both A-B and 
-C lists were presented for nine anticipation 
trials. Four different orders of the items In 
each list were used to avoid serial learning. 
Each order was used equally often as the 
Starting order. The MP intertrial interval 
I 4 sec. and the DP interval 1 min. The 
ti P intervals were filled with a digit-cancella- 
ton task. Approximately 30 sec. separated 
e learning of the two lists except 1n those 
Cases where an additional 9 min. of digit 
Cancellation was introduced before or after 
Massed A.C practice. The recall trial, ad- 
ministered 1 min, after the end of the A-C 
\me period, was a 2:2-sec. anticipation trial. 
elearning was then carried for an additional 
ee trials. Prior to recall and relearning Ss 
vere instructed to recall as many of the first- 
‘St responses as possible on the recall trial. 


RESULTS 
" Acquisition,—The acquisition and 
meting curves for all six exper- 
'éntal treatments are shown in Fig. 1. 
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and relearning curves for the six conditions. 


The mean numbers of correct re- 
sponses over the nine A-B anticipation 
trials for the MP-DP, MP-MP-C, and 
MP-C-MP conditions were 49.47, 
53.87, and 49.07, respectively. Mean 
correct first-list responses for the 
DP-DP, DP-MP-C, and DP-C-MP 
groups were 46.87, 47.60, and 47.40, 
respectively. In order to compare 
first-list performance of groups which 
later received the MP-C, C-MP, or 
DP conditions on the second list, and 
to compare A-B performance of the 
groups receiving the same or different 
List 2 practice conditions, the correct 
responses on the first list were evalu- 
ated in a 3 X 2 analysis of variance. 
'The obtained F values were less than 
1.00 for the two levels of differentia- 
tion, the three levels of second-list 
practice, and the interaction, implying 
that the groups comprising the various 
experimental conditions did not differ 
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significantly in terms of  first-list 
learning. 

The mean numbers of correct re- 

sponses over the nine A-C anticipation 
trials were 54.33, 56.67, 51.67, 47.13, 
55.73, and 55.33 for the MP-DP, 
MP-MP-C, MP-C-MP, DP-DP, DP- 
MP-C, and DP-C-MP conditions, re- 
spectively. These second-list data 
were also compared in an analysis of 
variance based on the two levels of 
differentiation and the three levels of 
second-list practice. Although there 
was a tendency for groups receiving 
different practice conditions on List 2 
to learn the second list better than 
groups receiving the same practice 
conditions on both lists, as might be 
predicted from the differentiation hy- 
pothesis, the difference was not sig- 
nificant, F (1, 84) = 1.60; p> .05. 
The second-list performance for groups 
receiving the DP practice condition on 
List 2 was somewhat below the groups 
receiving both the MP-C and the 
C-MP treatments, but this difference 
was also not significant, F (2, 84) 
= 1.48; p > .05. The latter result is 
consistent with the Underwood, Kep- 
pel, and Schulz (1962) finding that DP 
had a slight but nonsignificant effect 
upon A-E learning in the A-B, A-C, 
A-D, A-E situation. The interaction 
was not significant, F (2, 84) = 1.10, 
p > .05. The results of the analysis 
indicate that the experimental condi- 
tions had no significant effects upon 
acquisition of the second list. 

A-B responses appeared as intru- 
sions in the A-C learning of the 
DP-DP, MP-MP-C, MP-C-MP, DP- 
MP-C, and DP-C-MP conditions six, 
five, six, four, and five times, respec- 
Gvelv, The MP-DP group gave 17 
tively. " * 

a ions but 8 of these were pro- 
ae by one S. The low numbers of 
wane : lus the fact that one S's 
intrusions, ae for the major por- 


nses ac ea > 
respo between-conditions vari- 


tion of the 
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ance, suggest that little should be 
made of these intrusion data. 

Recall and relearning.—Table 1 con- 
tains the recall and relearning means 
and SDs for the six cells of the fac- 
torial design. The differentiation 
hypothesis predicts greater first-list 
retention for groups receiving differing 
treatments on the A-B and A-C lists, 
regardless of whether A-C was learned 
by DP or MP, because these groups 
had an Opportunity to differentiate 
the two lists on the basis of the 
different practice proceduresemployed 
during learning. The present inter- 
pretation of the extinction-recovery 
hypothesis implies poorer first-list re- 
tention (greater RI) following dis- 
tributed A-C learning, regardless of 
the A-B practice conditions. To test 
these predictions the recall and re- 
learning measures contained in Table 
1 were compared in 3 X 2 factorial 
analyses of variance containing the 
three levels of second-list practice 
(MP-C, C-MP, DP) and the two 
levels of list differentiation (same, 
different). " 

Analysis of the recall scores indi- 
cated no significant difference among 
the three A-C practice conditions, 


TABLE 1 
MEAN Numbers or A.B RESPONSES 
CORRECTLY REPRODUCED IN 
RECALL AND RELEARNING 


A-B, A-C Differentiation Cond. 
A-C Practice Same Different 
M SD | M 2m 
MP-C 
Recall 4.20 | 1.94 | 5.13 Be 
elearning | 32.80 | 5.52 | 31,80 | 5.1 
C-MP 
Recall 3.27 | 247 | 4.67 | 1.89 
pptelearning 29.80 | 6.24 | 32.00 | 7.1 
Recall 3.60 | 2.53 | 4.93 | 2.02 
Relearning 29.07 | 7.47 | 33.40 | 5.64 


LIST DIFF 


F (2, 84) < 1.00. The list-differen- 
tiation effect was significant, F (1, 84) 
= 6.80, p < .05, with groups receiv- 
ing different types of practice on the 
first and second lists showing better 
A-B recall than those receiving the 
same practice procedure on both lists. 
The interaction was not significant, 
F (2, 84) « 1.00. 

The Ss receiving the same practice 
conditions on both lists produced 17 
A-C intrusions in A-B recall while Ss 
receiving different practice conditions 
gave 7 intrusions. While the numbers 
are small, and little importance should 
be attached to the result, these data 
do conform to the list-differentiation 
expectation that there will be fewer 
A-C intrusions in A-B recall when 
A-B and A-C practice conditions are 
different than when they are the same. 

The analysis of variance performed 
on the relearning data showed no 
Significant differences for either the 
list differentiation, F (1,84) < 1.00, 
9r the second-list practice variables, 
F (2,84) < 1.00. The interaction was 
Not significant, F (2, 84) < 1.00. The 
relearning curves plotted in Fig. 1 
illustrate that the list-differentiation 
effect evident on the recall trial dis- 
appeared quickly during succeeding 
relearning trials. . , 

he numbers of A-C intrusions 1n 

for the same and 
different conditions during the five 
recall and relearning trials were 39 
and 21, respectively. While the num- 
rs were again very small they did 

Conform to the expectation, derived 
Tom the list-differentiation interpre- 
tation, that the same practice proce- 
dures would lead to a greater number 
intrusions than would different 


Pract 
Practice procedures. 


relearning 


DiscussION 


Principal findings of the expert 


The 
ere as follows. When Ss learned 


men tw 
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two lists in an A-B, A-C situation, with 
either MP or DP on both the first and the 
second list, recall of the A-B list following 
A-C learning was poorer than if Ss 
learned the first and second lists by 
different practice procedures (either DP 
of A-B and MP of A-C or vice versa). 
When A-B practice conditions were 
counterbalanced, A-B recall following 
distributed A-C learning did not differ 
from A-B recall following massed A-C 
learning. The implications of these find- 
ings for the extinction-recovery and the 
list-differentiation hypotheses presented 
in the introduction will be discussed 
separately. 

According to the present interpretation 
of the extinction-recovery hypothesis, B 
recall should be poorer following dis- 
tributed A-C learning than following 
massed A-C learning, regardless of the 
conditions of prior practice. The lack of 
a DP effect may be interpreted in several 
ways. First, it could conceivably reflect 
the fact that DP may not, by itself, lead 
to more effective extinction of prior 
associations in a transfer situation. 
However, this interpretation cannot be 
seriously considered on the basis of this 
one study which failed to find DP effects 
with an RI measure. A more likely inter- 
pretation is that MP and DP lead to 
different rates of B recovery over ex- 
tended periods of time but not, as has 
been assumed in this paper, to different 
levels of B strength immediately follow- 
ing A-C learning. According to this 
interpretation, which is perhaps the in- 
terpretation intended by Underwood, 
Keppel, and Schulz, both MP and DP 
extinguish first-list responses to the same 
level but result in different rates of re- 
covery. Thus, the present experiment, 
measuring retention 1 min. after A-C 
learning, may not have allowed sufficient 
time for differential recovery to occur and 
was, therefore, an inadequate test of the 
hypothesis. A more appropriate test 
would involve first-list retention after a 
24-hr. retention interval with list differ- 
entiation controlled, A third explanation 
of the failure to find a significant second- 
list practice effect would hold that a 
more appropriate test of the availability 
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under eight different exposure durations and 
eight luminances, by the method of constant 
stimuli. Exposure durations varied from 4 
msec. to 512 msec., in logs steps. Luminance 
was varied in logs steps from .09 mL. to 11.84 
mL. The fixation field luminance was always 
equal to that of the stimulus field. Within a 
session each of the eight luminances was 
constant for a block of 16 exposures (every 
exposure duration for each pattern), for a 
total of 128 exposures for each session. The 
order of luminance blocks was randomized, 
with the restriction that the initial block was 
always in the brighter half of the range and 
that successive blocks were within two steps 
of each other.  Pattern-exposure-duration 
combinations were randomized within each 
block, with each combination used only once 
within a block. 

Approximately .5 sec. before each exposure 
a warning signal was given with a manually 
operated buzzer, and .S focused on the space 
between two horizontal edge markers on the 
fixation field. To respond, S was asked to hit, 
as quickly as possible, one of two microswitch 
keys located on either side of a finger rest. 
For three of the Ss, the key on the right side 
was associated with the evenly spaced designs; 
for two of the Ss this key was associated with 
the paired designs. The designs were not 
described verbally, either by S or by E, until 
the end of the study. If S saw absolutely no 
indication of a design during the exposure 
(i.e, not even a blurred line), he was in- 
structed to hit either key, and then say the 
word "nothing." There was an interval of 
approximately 11 sec. between exposures 
within blocks of exposures and 45 sec, between 
each block, when intensity filters were 
changed. 

Subjects—The Ss were five male college 
seniors who were paid for their Participation. 
None had previous experience in perception 
experiments. 


Results 


Figure 1a shows the mean RT for 
each pattern as a function of exposure 
duration, averaged across all lumi- 
nances. Figure 1b shows the mean RT 
for each pattern at every luminance, 
averaged across all exposure dura- 
tions. Each point in Fig. 1a and 1b is 
based on correct resporises from See 
of 450 exposures. Only the T Ao 
correct responses were selected in the 

ief that these would most direct y 
e differences in discrimination 
difficulty of these patterns. 


Figures 1a and 1b show that, in 
general, R'Ts become faster with in- 
creases in duration and luminance, but 
exceptions occur at the longest dura- 
tion and brightest luminance where 
there is some tendency for RT to 
become slower again. 

Figures 1a and 1b also indicate an 
RT ordering of patterns which is 
similar to the ordering obtained for 
accuracy of identification (Kaswan & 
Young, 1963). With both accuracy 
and RT measures, the obtained order- 
ing from lowest to highest accuracy 
and fastest to slowest RT was Pri 
Prə< Pr; and Ev; < Ev.< Ev;. Also, 
as for accuracy, RT to paired patterns 
tended to be slower than the RTs to 
comparable Ev patterns, except for 
the Ev;-Pr; set, where RT to the two 
patterns was about the same, probably 
because these patterns were easily 
discriminated. ; 

While RT reflects the order in which 
these patterns were accurately dis- 
criminated, variations in luminance 
appear to affect RT and accuracy 
differently. For accuracy, it was 
found that discrimination of the pat- 
terns depended largely on exposure 
duration and that variation of lumi- 
nance had little effect on the level of 
accuracy. Figure 1b shows, however, 
that RT was consistently and sub- 
stantially affected by variations in 
luminance. 

While Fig. 1a and 1b show the 
"main effects" of exposure duration 
and luminance on RT, Fig. 2 shows 
how these variables interact in their 
effect on RT. The parts of Fig. 2 
labeled a through h contain the results 
of the experiment described above. 
The parts of the figure labeled a 
through h’ report the results of the 
supplementary study which will be 
described later. 

Each point in Fig. 2a through 2h is 
a mean RT based on 340 responses; 
the result of combining correct, in- 
correct, and “nothing” responses tO 


MULUS AND TASK 


la 


MEAN REACTION TIME (ms.) 


4 16 
EXPOSURE DURATION (ms.) 
Fic. 1. Mean reactior 


a j 
a Aged over all luminance 
averaged over all exposure-C 


values for the main 
luration values. 


all six stimuli by the five Ss. There 
Was close similarity in the form of the 
curves for each pattern and each type 
8 response, although incorrect and 

nothing” responses Were slower and 
More variable than correct responses. 
"igure 2 is therefore quite representa- 
tive of the results. Vertical bars at 
representative points in each curve, 
indicating the standard error at each 
9f these means, show that variability 
is small relative to the range of RT 
effects obtained as a function of lumi- 
"ance and duration. Only mean RTs 
Ky uh he Iret 

1 are included. The fr 


| time for each patt 
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py 
ar 148 5.92 


INTENSITY (mL.) 


ern as a function of (a) exposure duration 
experiment and as a function of (1b) luminance 


sessions were eliminated because RT 
decreased over these sessions. During 
the last two sessions, RT became very 
variable, perhaps because Ss antici- 
pated the end of the experiment. 
Each of the solid curves in Fig. 2 
shows the effect of exposure duration 
on RT under a different luminance 
parameter. The exposure-duration 
curve for the brightest parameter 
(11.84 mL.) is shown as the solid line 
in Fig. 2a. Luminance values then 
become successively dimmer, with the 
solid curve under the dimmest param- 
eter (.09 mL.) shown in Fig. 2h. 
Solid curves are to be read against the 
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STIMULUS AND TASK COMPLENITY AND RT 


exposure-duration scale shown on the 
abscissa (labeled “ms.”) in each 
part of Fig. 2. lt is apparent that 
the exposure-duration-R T function 
changes with the level of luminance. 
As luminance parameters become 
dimmer (solid lines, Fig. 2a to 2h), the 
slowest RT occurs at increasingly 
longer exposure durations. Thus, the 
slowest RT occurred at the 4-msec. 
exposure under the 11.84-mL. lumi- 
nance parameter (Fig. 2a), at the 
8-msec. exposure under the 1.48-mL. 
Parameter (Fig. 2d), at about 16 
msec. under the .74-mL. parameter 
(Fig. 2e), at 32 msec. under the 
-18-mL. parameter (Fig. 2g), and at 
64 msec. under the .09-mL. parameter 
(Fig. 2h). This shift in the exposure 
duration for which the slowest RT 
Occurred leads to a gradual inversion 
of the slope in the RT-exposure-dura- 
tion function. At one extreme, under 
maximum luminance (11.84 mL., Fig. 
2a) RT became faster between 4 and 
16 msec. At the other extreme, under 
Minimum luminance (.09 mL., Fig. 
2h) RT became slower between 4 and 
msec, Thus the slope of the RT 
Unction in Fig. 2h is in the opposite 
‘rection to the slopes shown 1n Fig. 

2a, 2b, 2c, and 2d. . 
. The curves drawn with dashed lines 
in Fig, 2 show the effect of luminance 
9n RT at each exposure-duration pa- 
rameter, For example, the dashed 
Curve in Fig, 2a shows the eflect of 
"minance on RT for the 512-msec. 
Parameter, Exposure-duration pa- 
rameters (shown next to each curve) 
then become successively briefer, with 
the briefest (4 msec.) for the dashed 
ine in Fig 2h, These curves are to be 
against the luminance scale 
(labeled “mL.”) shown below the ex- 
Posure-duration scale in each part of 

1g. 2. 

The scales are arranged so that 
Points directly above and below each 
€r on the dashed and solid curves 
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represent equal luminance-exposure- 
duration products. There is a tend- 
ency toward superimposition of the 
dashed and solid curves, indicating 
that changes in luminance and expo- 
sure duration generally have a similar 
effect on RT under these conditions. 
Figures 2d and 2e indicate, however, 
that the shift toward a reversal in the 
direction of the RT slope appears to 
occur a little later in the luminance 
curves (dashed lines) than in the 
exposure-duration curves (solid lines). 

Since the shift in the direction of 
the slope of these curves occurred 
gradually with decreasing parameter 
values for both luminance and expo- 
sure duration, these results appear to 
constitute a reliable finding. 

Figures 2a through 2h show that 
the present results differ in two major 
ways from RT findings in flash detec- 
tion. One difference is that at low 
luminances (e.g., solid line, Fig. 2h) 
RT becomes slower as exposure dura- 
tion increases. The same effect was 
obtained for increases in luminance 
under brief exposure durations (e.g., 
dashed line, Fig. 2h). At dim and 
brief parameter values previous stud- 
ies found their most substantial RT 
variations in the opposite direction 
from thosereported here. Thatis, RT 
tended to become faster with increas- 
ing exposure duration or increasing 
luminance, given, respectively, dim 
or brief parameter values. 

A second difference between these 
results and previous findings relates 
to the difference in the size of the 
range of exposure durations and lumi- 
nances affecting RT. As noted, Raab 
and Fehrer (1962) found that even at 
their dimmest luminance parameter 
(.3 ftL.), exposure durations longer 
than 20 or 25 msec. had little effect on 
RT, whereas the range of effective 
luminance values was much wider (up 
to 3,000 ftL.). Figures 2a through 2h 
show, on the other hand, that expo- 
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changes at bright luminances, since 
both their data, and ours, indicate 
that, as luminance levels increase, 
greater increments in absolute bright- 
ness seem necessary to produce equiv- 
alent RT changes. 


DISCUSSION 


The substantial similarity of the Raab 
and Fehrer findings and the results of our 
supplementary study indicate that the 
luminance conditions used (i.e., flash or 
constant illumination of fixation and ex- 
posure field) make little difference in how 
luminance and exposure duration affect 
RT. The similarity in the results of 
these studies is especially striking in view 
of the differences in apparatus and pro- 
cedures used. 

The results further suggest that the 
range of exposure durations affecting RT 
in a detection task is similar to the range 
affecting detection thresholds. Thus, in 
the supplementary study, exposure dura- 
tions longer than 16 msec. did not sub- 
stantially affect RT (Fig. 2a’ through 
2h’), Similarly, in a previous study, 
probability of detecting a single large dot 
under dim luminance (.09 mL.) was most 
affected at exposures briefer than 16 
msec, (Kaswan & Young, 1963). 

The Raab et al. and our findings also 
indicate that RT in detection tasks is 
affected by a much wider range of lumi- 
nance than exposure-duration values, 
This suggests that the effect of luminance 
in effecting minimum RT is different 
from the effect of luminance on “high 
criterion” detection thresholds (near 
100% detection). The latter appears at 
least grossly determined by a reciprocal 
luminance-duration relationship (Bun- 
sen-Roscoe "law", whereas the minimum 
RT cannot be reached by increasing ex- 
posure duration to compensate for lower 
luminance (compare, for example, RTs 
to 300 ftL. and 3,000 ftL. flashes in the 

s). 
E aq a with the Raab and 
Fehrer findings also indicates that the 
zling results of our main study—the 
pepe in the direction of RT slopes at 


dim and bricf parameters and the ex- 
tended effect of exposure duration and 
luminance on RT—were probably due to 
the complexity of the discrimination task 
and do not depend on the way in which 
luminance and exposure duration were 
varied, 

In a previous study the authors sug- 
gested that luminance and exposure 
duration function differently in deter- 
mining Probability of detection and 
accuracy of pattern discrimination (Kas- 
wan & Young, 1963). The present 
findings suggest a similar conclusion with 
regard to reaction time, since luminance 
and exposure duration clearly generate 
different RT functions in detection, as 
compared to pattern discrimination (com- 
pare Fig. 2a through 2h with Fig. 2a 
through 2h^. The distinction between 
discrimination and detection may thus 
have more than semantic merit an 
Warrants further investigation. 


REFERENCES 


BaUxcanpr, E., & SEGAL, J. Facilitation et 
inhibition parametres de là fonction visuelle- 
Annee psychol., 1942, 43, 54-102. x 

FRoEBERG, H. The relation between magni- 
tude of the stimulus and the time of 25 
action. Arch, Psychol, N. Y., 1907, 25 
No. 8. 

Kaswan, J., & Youne, S. Stimulus exposure 
time, brightness, and spatial factors 85 
determinants of visual perception. J. exp. 
Psychol., 1963, 65, 113-123. f 

RAAB, D. & Fenrer, E, The effect © 
stimulus duration and luminance on e 
reaction time. J. exp, Psychol, 1962, 65 
326-327. 1 

Raan, D., FEHRER, E., & HERSHENSON, es 
Visual reaction time and the Broca-Sulze 
Phenomenon. J, exp. Psychol, 1961, 6^; 
193-199, ae 

TEICHNER, W. H. Recent studies of OA 
reaction time, Psychol. Bull, 1954, 5h 
128-149, 

Wetts, G. R. The influence of similis 
duration on reaction time. Psychol. 
Monogr., 1913, 15(5, Whole No. 66). i. 

Woopwonrg, R. S., & ScurosnEnc, H. d 
Derimental psychology. New York: Holt, 

954, 


(Received February 17, 1964) 


Journal of Experimental Psycholo, 
1965, Vol. 69, No. 4, 401-407 Er 


SIGNAL UNCERTAINTY AND SLEEP LOSS! 


HAROLD L. WILLIAMS? OMETTA F. KEARNEY, anp ARDIE LUBIN € 
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During a 3-5 day base-line period, 2 days of sleep loss, and 3 days of 
recovery, 52 Ss performed 3 visual vigilance tasks, of 10 min. each rang- 
ing in signal uncertainty from complete redundancy to .84 bit per 


second. 
which increased with sleep loss. 


certainty and sleep loss was significant. 


The major effect of uncertainty was to cause errors of omission 
The interaction between signal un- 


Task duration (of 10 min.) 


caused no impairment during the base-line and recovery phases, but 


during sleep loss, errors of omissio 
There was no signific: 


each task. 
certainty and task duration. 


n rose sharply on the last 3 min. of 
ant interaction between signal un- 
Decrement was considerably greater for 


Ss working alone than for Ss working ina group. Oral temperature had 


no consistent relation to errors o 


During sleep loss, impairment of 
Performance seems to be related to the 
uncertainty of the task, but the nature 
of the relationship is not clear. Wilk- 
inson (1958) suggested that perform- 
ance decrement during sleep depriva- 
tion increased as task predictability 


increased. Thus, "complex" tasks 
involving multiple displays, high in- 
formation load, or a variety of 


responses should maintain a sleep-de- 
Prived operator in an aroused” con- 
dition, In general Wilkinson showed, 
as had several other investigators 

(Katz & Landis, 1935; Robinson & 

obinson, 1922), that “complex 
tasks were not very sensitive to sleep 
loss, 

_A similar explanation has been 
&iven for the results found with com- 
Plex and simple tasks used in studies 
or signal detection. When simple 
i most studies 


‘splays were used, 
e of 


Ound a decline in percentag 
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f omission or to sleep loss. 


signals detected over time (McGrath, 
Harabedia, & Buckner, 1959). In- 
vestigators who used more complex 
tasks found no decrement in signal 
detection (Adams, Stetson, & Humes, 
1961; Broadbent, 1950). 

Preliminary studies in our labora- 
tory failed to confirm the suggestion 
that increased task uncertainty raises 
the efficiency of the sleep-deprived S. 
Performance on a vigilance task 
designed with high signal uncertainty 
was greatly impaired during sleep loss, 
although a redundant task showed no 
deterioration. The present studies 
were designed to confirm these results, 
and to test for interaction between the 
effect of signal uncertainty and the 
effect of sleep loss. The experiments 
also permitted examination of the 
effects of task duration, group testing, 
and reduced body temperature. 


METHOD 


Subjecls.—Fifty-two United States Army 
enlisted volunteers (mean age, 24 yr.; range, 
18-45 yr.) took part in the experiment. In- 
telligence scores obtained by averaging the 
Vocabulary and  Arithmetical Reasoning 
scores of the Army Classification. Battery 
(Montague, Williams, Lubin, & Gieseking, 
1957) ranged from one SD (ec = 20) below the 
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Army mean of 100 to two SDs above; the 
mean was 105. : 

Four Ss were quartered at a time on a 
ward. The first (reception) day was devoted 
to familiarization with the schedule and the 
tests. After 3-5 days of base-line practice on 
a regular sleep schedule, Ss had 64 hr. (2 
nights) of sleep loss followed by 3 days of 
recovery testing. During the sleep-loss phase, 
Ss were kept awake constantly by Os on a 
24-hr. schedule. On base-line and recovery 
days, all Ss went to sleep between 2200 and 
2300 and were awakened at 0600. The tasks 
were given between 1330 and 1500. During 
the remaining hours, Ss were occupied with 
other activities. 

Twenty-four Ss (Group 1) were tested in- 
dividually. To examine the effects of working 
in a group, the remaining 28 (Group 2) were 
tested in groups of four. For this latter 
group, oral temperatures were obtained at 
the beginning of each test session. 

Apparatus and procedure.—The vigilance 
apparatus? consisted of a display panel of five 
lights (red, yellow, green, blue, and white) 
arranged in a Pentagon with sides of 24 in. 
The diameter of each light was 1 in., and the 
center of the pentagon was 60 in. above the 
floor. The S sat at a table at a viewing 
distance of 8 ft. The response key was a 
microswitch on a foam rubber base, fastened 
to the table in a position convenient to .S's 
preferred hand. During group testing ses- 
sions, the four Ss viewed the same display 
from individual booths arranged side by side. 

On the reception day Ss were taught to 
press the response key each time the red light 
came on and to refrain from pressing for all 
otherlights. Correct responses were recorded 
on counters, 

"Three tasks differing in predictability were 
used: a standard task (S), a redundant task 
(R), and an uncertain task (U). Each task 
was programed with a five-channel punched 
tape, and a Western Union tapereader. The 
tape for each task consisted of 600 stimuli 
which ran at a rate of one light per second for 
10 min. Eight lights out of 30 were signals 
(red). The tapes were run in the order S, U, 
R, with approximately 30-sec. rests between 
a S task was modeled closely on the 
Continuous Performance Test (CPT) de- 
Se ibed by Rosvold, Mirsky, Sarason, Bran- 
scribe ] Beck (1956). The same proportion 
sn pra of signals as well as the same 
pared interval were used. The eight 


4 apparatus is a modification of one 
he apparatus I 

n rsoni. 
designed by C. S. Gerson 


red signals out of 30 stimulus lights appeared 
in Positions 6, 9, 11, 14, 18, 23, 27, and 30. 
Each 5 saw this fixed sequence 20 times during 
a test session (10 min.). The same tape was 
used each day. 

The U tape was randomized with r 
to the 5 lights, subject only to the restr’ c 
that 8 lights of each set of 30 were red signal 
lights. A different 10-min. U tape was used 
each day. " 

The R tape consisted of 30 lights in a fixed 
sequence. The 8 red lights were together. 
The S immediately learned to press the key 
cight times in a row after 22 nonred lights had 
been presented. Each S saw the fixed R 
sequence 20 times (10 min.) on each day of 
testing. Thesame R tape was used every day. 

Stimulus uncertainty for the S task was 
initially 0.84 bit per stimulus; it approache 
Zero as learning progressed to asymptote: 
The U task had an average stimulus uncer- 
tainty of 0.84 bit per stimulus throughout 
the sessions. The R task, learned completely 
during the reception day, had a stimulus gA 
certainty of zero on all testing days. The S 
were told that the S task contained a complex, 
but learnable structure, that the U task con- 
tained no identifiable pattern, and that the 
task was completely predictable. 


RESULTS 


Effect of signal uncertainty.—Vigure 
1, based on Group 1, shows that the 
three tasks, S, U, and R are sensitive 
to sleep loss, and that the error scores 
(missed Signals) return to base-line 
levels after one night of recovery sleep- 
Sleep-loss decrement is considerably 
greater for the U task than for the 
other two. 

A rank-order test (Lubin, 1961) 
performed on the base-line means © 
the 3-day and 5-day practice groups 
showed no significant decline in the 
Percentage of errors of omission over 
the base-line days for either Group 
or Group 2. Therefore, it was as- 
sumed that S's performance was 
approximately stationary. An aver- 
age base-line score (B) was compute 
for each S, The effect of the first 
night of sleep loss was estimated from 
dı = D, — B, the percentage of errors 
of omission after the first night of 


ent 
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1 
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: 
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BASELINE SLEEP LOSS RECOVERY 


ance with three levels of signal uncertainty (Individual 


Fic, 1, 2 
the last 3 base-line days were used.) 


„Effect of sleep loss on visual vigil 
esting), (For the 5-day practice group, 


sleep loss minus the average base-line 


TABLE 1 uie E A 
score. Similarly, the effect of 2 nights 


SLEE ¿p Type or TASK à 
- SLerr Loss Axp Type or T4 of sleep loss was estimated by dy 
= — ; = D. — B. 
a (55 Hr. E 
a di (31 Hr.) as ( Table 1 shows average decrement 
‘ask 


i [Variance scores and their variances on the three 

= Ar SPEI E tasks for both Group 1 and Group 2. 
The ż test? on these means showed that 
after 1 night of sleep loss, errors of 
{andra (s) 3 m omission on the U task increased sig- 
Redundo (Ud, | 2958" 3146 Tes nificantly in both groups. After the 
ee is i second night of sleep loss, all three 
tasks showed a significant increase 

over base line in percentage errors of 
Undara (s) z ssi | 312%) $538 — omission. Since the S, U, and R 


Group 1 (Individual Test) 


Group 2 (Group Test) 


necerta E eet 

R ain (U +k “68 3.85 * 
edundane qu) | 52) 262 | .09* | 417 means were correlated, the Hotelling 
enc ^ 


NS NM E ns T? zero-mu test (Rao, 1952, p. 243) 


Note 3 is the percent CN 
avers of Sit = Dic B where Peep loss B isthe for the deviation of a vector of means 
1 Aus ing ase-lin suos 
Derigg’® Percent errors of omission during Hc from zero was performed within each 
x 
wad X.05. r 5 
ag P SS 01, 5 All £ tests in this paper are one-tailed. 


? «.001. 
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group. It confirmed the univariate 
t tests. 

These results implied that the U 
task was more sensitive to sleep loss 
than the other two. The Hotelling T? 
symmetry test (Rao, 1952, p. 241) for 
equality of correlated means showed 
that the average decrement scores for 
the three tests did not differ signifi- 
cantly after 1 night of sleep loss. How- 
ever, after 2 nights of sleep loss, the 
impairment on the U task was sig- 
nificantly greater (p < .01) than on 
the other two tasks. This differential 
effect was found in both Group 1 and 
Group2. Ineach group, the Hotelling 
linear function for the optimal meas- 
urement of sleep-loss impairment gave 
positive weight to U and negative 
weights to the S and R tasks for both 
the d; and d» vectors. The symmetry 
test thus confirmed the greater sensi- 
tivity of the U task to sleep loss. 

To determine if there was an 
interaction between the amount of 
sleep loss and signal uncertainty, dı 
was subtracted from d; for each S on 
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| 2s 4 
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Effect of duration and sleep loss on the Standard (S) task. 
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each task. In both Group 1 and 
Group 2 a Hotelling 7* symmetry test 
on these vectors of difference means 
showed that the increase in errors of 
omission (from D, to D») was signifi- 
cantly greater (p < .05) for the U 
task than for the other two tasks. 

Effect of task duration.—In Fig. 2, 
percentage errors of omission during 
each minute of the S task has been 
plotted against task duration for each 
sleep-loss day and for the base-line 
phase.  Task-duration scores were 
available only for Group 1. Results 
for the recovery phase have not been 
plotted since they were almost exactly 
the same as the base-line scores. The 
U and R tasks (not shown) gave 
approximately the same results as the 
S task. 

After 1 night of sleep loss, there was 
little change on the S task for the first 
7 min. During the last 3 min., errors 
of omission increased rapidly over 
base-line levels. After 2 nights of 
sleep loss, omissions increased rapidly 
after 2 or 3 min. of work. 


o 


, 
pss hours of sleep loss 


P^ «— 3l hours of sleep loss 


<— Baseline 
x 


5 € t€ 8 9:109 
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TABLE 2 
Task DURATION, TYPE OF TASK, AND SLEEP Loss 
di (31 Hr.) d: (55 Hr.) 
s U R s U R 

First 3 min. 

M —.23 1.16 .93 .90 10.36 4.31 

s 2.67 10.39 10.64 7.11 246.41 63.91 

t — 1.76* 1.39 1.65 3.23*** 2.64** 
Last 3 min. 

M 2.92 5.15 1.07 11.26 15.22 7.06 

s 45.84 96.62 19.21 183.47 354.35 114.38 

t 2.11* 2.57** 1.20 4.07*** 3.96*** 3.23*** 

Last minus first 3 min. 

M 5 3.99 44 10.36 4.86 2.75 

s 39:38 61.90 25.29 | 159.56 144.97 67.87 

t 2.46* 2.49* <1 4,02*** 1.98* 1.86* 

Note,—Entries in the table are percent errors of omission based on di = Di — B. 

HE 
* 5 21901. 


Table 2 shows that during the first 

3 min, there was significant decrement 
on the U task after both the first and 
Second sleep-loss nights. During the 
last 3 min., 1 night of sleep loss was 
Sufficient to produce significant dec- 
rement on both the U and S tasks. 
fter 2 nights of sleep loss all 
three tasks showed highly significant 
€crement. , 
The task-duration effect was esti- 
mated for each S by comparing the 
€crement score in the final 3 min. 
With the decrement score in the first 
min. of each task. The bottom 
Section of Table 2 shows that on all 
tasks, the percent errors of omission 
increased from first to last 3 min., and 
at, in general, the task-duration 
effect increased as sleep loss increased. 
€ difference in decrement scores 
etween first and last 3 min. was 
Significant after 1 night of sleep loss 
or Tasks U and S. Task R, however, 


showed a significant duration effect 
only after 2 nights of sleep loss. The 
increase in the task-duration effect 
from 31 hr. to 55 hr. of sleep loss was 
significant only for Task S. 

Effect of group testing —Table 1 
indicates that sleep-loss impairment 
was considerably less for Group 2 
(group testing) than for Group 1. 
The £ tests showed that the effect of 
the social situation was significant for 
the U and R tasks after the first night 
of sleep loss and for all three tasks 
after 2 nights without sleep. A linear 
discriminant function analysis con- 
firmed this finding. 

Effect of oral temperature.—Oral 
temperatures were taken on each S 
tested in group performance just be- 
fore the testing session. The average 
base-line temperature was 97.99? F. 
decreasing to 97.96? F. after 1 night 
of sleep loss, and to 97.81? F. after 2 
nights of sleep loss. These decreases 
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from base line were not significantly 
different from zero. The Pearson 
product-moment correlations of oral 
temperature changes (deviations from 
base-line average oral temperature) 
with the decrement scores after the 
first night of sleep loss were 0.09 for 
Task S, —0.19 for Task U, and —0.37 
for Task R. A negative correlation 
indicated that errors of omission in- 
creased as oral temperature decreased. 
However, after 2 nights of sleep loss 
the correlations were all positive, and 
ranged from 0.08 to 0.29. Therefore, 
there was no consistent relation be- 
tween oral temperature and errors of 
omission during sleep loss. 


DISCUSSION 


The present findings confirm previous 
results obtained in this laboratory: in- 
creasing task uncertainty lowers the 
efficiency of the sleep-deprived S. Are 
these results, then, inconsistent with the 
earlier findings that complex tasks are 
not sensitive to sleep loss? 

Wilkinson (1964) abandoned his 
preliminary guess that increased predict- 
ability potentiates performance decre- 
ment during sleep loss. Instead he 
suggested that motivation may be a key 
factor in the different kinds of results 
obtained. He assumed that tasks vulner- 
able to sleep loss largely lack incentive. 
Interesting, challenging tasks, on the 
other hand, resist the effects of sleep loss. 
Ordinarily, interesting tasks have high 
stimulus-response uncertainty, but un- 
certainty and interest have antagonistic 
effects during sleep loss. Unfortunately, 
no experiment has been done in which 
interest and uncertainty were varied 
independently. 

Lapse hypothesis—The impairment of 
performance during states of sleep dep- 
rivation has been attributed to the 
occurrence of lapses (Warren & Clark, 
1937). Williams, Lubin, and Goodnow 
(1959) end Williams, Granda, Jones, 
and Armington (1962) showed 


a (defined on the basis of the 


that la 


EEG as brief periods of light sleep) were 
associated with errors of omission in 
work-paced vigilance tasks and that 
lapses increased in frequency, duration, 
and depth as sleep loss increased. 

Since greater signal uncertainty causes 
longer S-R intervals (Hyman, 1953), the 
relatively high impairment in perform- 
ance on the U task may have resulted 
from a greater coincidence of lapses with 
critical periods in the protracted S-R 
interval. Assuming that the probability 
of a lapse and the probability of a signal 
are independent, frequency of missed 
signals should be a multiplicative func- 
tion of the two; therefore, one would 
expect, as found here, an ordinal inter- 
action between sleep loss and signal 
uncertainty. Further analysis of the 
data of Williams et al. (1962) revealed 
also that during sleep deprivation the 
frequency of EEG lapses increased as the 
task duration increased. Thus, the m- 
crease in errors of omission with increas- 
ing task duration could be the result o 
an increase in the rate of lapses. Finally 
a beneficial effect of interest on task 
performance may result from the occur- 
rence of a lower lapse rate during the 
performance of interesting tasks. 

Social stimulation has been show" 
previously to offset the effects © 
sleep deprivation (Ax, Fordyce, Loovas: 
Meredith, Pirojnikoff, Shmavonion, & 
Wendalh, 1953; Laties, 1961). The 
results of the present experiment, where 
Ss working in a group showed less decre- 
ment than Ss tested individually, confir! 
these findings. 

There was no relation between change 
in oral temperature and errors of 0m15- 
sion. Thus, the present data provide an 
exception to Kleitman's generalization 
that performance follows body t¢™ 
perature (Kleitman, 1933). 
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S-R COMPATIBILITY AND 


INFORMATION REDUCTION ! 


PAUL M. FITTS axb IRVING BIEDERMAN 


Universit 


S-R compatibility effects were examined in 
tasks (1-bit information conserving, 


and 2 to 1 bit condensing) in comb 
fingers of 1 hand only 


Morin, Forrin, and Archer (1961) 
recently investigated the effects of 
different “perceptual demands" im- 
posed by irrelevant stimulus informa- 
tion in tasks where Ss are required to 
use fewer responses than there are 
stimuli, and concluded that perform- 
ance is essentially independent of 
variations in perceptual demands, 
being primarily a function of trans- 
mitted information. Posner (1962, 
1964) has subsequently proposed a 
taxonomy for such tasks. He proposes 
the term information conservation for 
tasks in which the number of stimulus 
ind response categories correspond, 
ind the term information reduction 
or tasks in which there are fewer re- 
sponse categories than stimulus cate- 
rories. Information reduction may be 
ccomplished by filtering (or gating), 
yhere one or more stimulus dimen- 
ions are irrelevant and may be 
snored, or else by condensing informa- 


1 This research was supported by the Ad- 
inced Research Projects Agency, Depart- 
ent of Defense, and was monitored by the 
ir Force Office of Scientific Research, under 
)ntract No. AF 49(638)-1235 with the 
uman Performance Center, Department of 


ychology. 


y of Michigan 


4 information-processing 
2-bit conserving, 
ination wit! 


tion, and tested for 2 
Itering were accomplished 
‘odes. The other 4 tasks 
When a compatible (2- 
on conserving was more 
ng, notwithstanding the 


tion, where all stimulus dimensions 
are relevant and must be considered. 
Condensing information would appear 
to impose greater perceptual demands 
than filtering. Thus Fitts (1959) has 
predicted that filtering should, in most 
cases, be accomplished more efficiently 
than information conservation, given 
the same stimulus source, whereas 
Posner (1962), examining informa- 
tion-condensing performance in @ 
variety of tasks, has found that the 
latter form of information reduction 
requires more time than does informa- 
tion conservation, given the same 
sequence of stimuli. Morin, Forrim 
and Archer’s (1961) results are an 
exception to these generalizations: 
since condensing as well as filtering 
information (Cond. IV and V 9 
Fig. 1) was found to be faster than 
information conservation (Cond. iu ). 

In examining the Morin, Fortis 
and Archer (1961) experiment, it d 
peared to the present authors tha 
their four-choice, information-conser- 
vation task might be low in S-R com- 
patibility (see Fitts & Seeger, 1953) 
and that compatibility effects might 
thus account for the fact that this was 
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S-R COMPATIBILITY AND INFORMATION REDUCTION 4 


INFORMATION 
GROUP PROCESSING TASK STIMULUS-RESPONSE CODES 
Stimuli 
eo e m BE 
Responses 
I Conservation(I—| Bit) FR 2R 
0-8 Conservation(I—! Bit) FL. ER 
IH Conservation(2-2 Bits) 3-R IR 2R 4R 
0-8 Conservation(2-2 Bits) 2-L I-L cR 2R d 
IV Filtering || (2—1 Bit) I-R -R 2R 2R 
Filtering (2—! Bit) rL I-R I-R 
4 Condensing (2—!Bit) 
vB Condensing (2—IBit) 
S| 


* " 
I- First or forefinger, 
L= Left hand, R= Right hand. 


Fic. 1. Experimental conditions for the eight groups. 


the most difficult of the four tasks 
Which they studied. Accordingly it 
Was decided to try to find a more 
Compatible set of four-finger responses 
Or use with their stimuli (one square, 
ne circle, two squares, or two circles). 

Population stereotypes were deter- 
mined by asking Ss to indicate the 
Our fingers they would prefer to use 
With the four stimuli and how they 
Would form S-R pairs. Out of ap- 
Proximately 100 Ss so queried, nearly 
all selected the middle and forefingers 
of the right and left hands, assigning 
Stimulus form to hands, and stimulus 
number to fingers as shown in Cond. 
NI-B, Fig. 1. “This “two-dimensional’ 
Tesponse set was therefore selected as 
ikely to be more compatible than the 
One-hand response set (Cond. II) 
Used in the previous study. All four 
Information.handling tasks were stud- 
ed, using both one-hand and two- 
and response sets, but it Was pre- 
dicted that S-R compatibility effects 
Should be prominent only for the task 
"fquiring the use of four responses 


2= Second or middle finger, etc. 


(Cond. III and II-B), since result 
from recent studies (see Broadben 
1963; Fitts, 1964; Griew, 1958) ir 
dicate that compatibility effects at 
negligible for two-alternative choice: 
but increase rapidly as the number « 
alternatives is increased, i.e., con 
patibility effects show up in the slo% 
of the function relating reaction tim 
(RT) to information transmitted, nc 
in the intercept. 


METHOD 


Stimulus forms were drawn in black ink c 
white cards, and exposed by the opening of 
double-bladed shutter. Exposure was te 
minated by S’s response, whether correct « 
in error (instead of by correct response onl 
as in the previous study), and RTs we 
measured for the initial response, wheth 
correct or in error (instead of for the correctic 
when an error was made), in order to confor 
to conventions for computing informatio 
transmission rates. Otherwise the meth 
employed by Morin, Forrin, and Arch 
(1961) was used. 

Ten Ss were assigned randomly to each 
eight groups, and each group assigned to o 
of the conditions of Fig. 1. All Ss were test 
first for one block of trials under simple R 
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conditions (Cond. I), as in the previous study, 
and were then given seven blocks of choice 
RT trials, 16 trials per block, on one session, 
followed by eight blocks of 16 trials during 
another session 48 hr. later. 


RESULTS 


Reaction time.—Median RTs were 
computed for each S for each block of 
trials, and group means determined. 
The eight tasks clearly fall into 
two distinct clusters (see Fig. 2). 
Throughout both sessions Cond. II] 
and V required longer RTs than did 
Cond. II and IV. Since there was an 
overlap of only 3 Ss in the tails of the 
two RT distributions out of a total 
of 80 Ss, the significance of this gross 
difference is clear. Data for the two 
sets of conditions and for each session 
were therefore analyzed separately. 

As stated previously, comparisons 
of performance among the four most 
difficult tasks is most relevant for 
assessing S-R compatibility effects. 
Inspection of Fig. 2 reveals that the 
predicted effects are in fact present. 
The shift to a more compatible two- 
hand response set reduced the mean 


— eae 


f Task One Hond Two Honds 


Conserving (2 B1) — 
TA Condensing (2-181) = 4—a | 
a. 


Filtering (2-1 Bil 
(dp \ E [nm 
| ve 9 
| LAY ON 
3" a 

H N 


REACTION TIME IN MSECS 


Pan 


1234567 
DAY 2 


1234567 
DAY 1 
BLOCKS OF 16 TRIALS 


Fic. 2. Group means for the eight psi 
mental conditions, based on individual m 
dians for each block of 16 trials. 


RT for the 2-bit information-conser- 
vation task by about 100 msec. on 
initial trials and reversed the superi- 
ority of the 2 to 1 bit information-con- 
densing task relative to the 2-bit 
information-conserving task. Results 
of an analysis of variance for these 
data (see Table 1) indicate that 


TABLE 1 
ANALYSES OF VARIANCE FOR THE DIFFICULT Tasks ror BOTH SESSIONS 
Day 1 Day 2 
Source 

df MS F df MS F 
Between Ss 39 39 
Info. proc. task (1) 1 7 ns 1 3.9 un 
Response codes (C) 1 885.7 4,51* 1 849.9 4.80 
IS C 1 1395.2 7.11* 1 140.8 s 
Ss within groups (Ss) 36 196.2 36 177.1 
Within Ss 240 280 

** 
Trials (T) 6 203.7 6.23** y 127.2 6.91 
Fon ( 6 20.1 ns 7 22.1 ns 
CHT 6 20.2 ns 7 9.6 us 
IXCXI 6 47.6 "s 7 17.9 us 
Tx Ss 216 32.7 252 18.4 
#p <.05. 


*# p c.01. 
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TABLE 2 


ERROR AND INFORMATION TRANSMISSION DATA FOR THE EIGHT GROUPS OVER Bo 


TH SESSIONS 


H: per Second 


Group Task Response Code | Percent Errors | 2+ Bey ngponse (Bits) 
II 1-bit conserv. 1 hand 3.62 0.81 1.76 
II-B 1-bit conserv. 2 hand 4.33 0.76 1.70 

II 2-bit conserv. 1 hand 9.04 1.56 2.40 

III-B 2-bitZconserv. 2 hand 7.79 1.63 2.76 
IV 2 to 1 bit filter. 1 hand 4.79 0.77 1.70 
IV-B  |2to 1 bit filter. 2 hand 5.00 0.77 1.63 
V 2 to 1 bit condens. 1 hand 7.96 0.69 1.10 
V-B 2 to 1 bit condens. 2 hand 11.75 0.51 0.83 


learning (trials) was statistically sig- 
nificant within each day, that response 
coding effects (two vs. one hand) were 
Significant within each day, and that 
the Task X Response Code inter- 
action was significant on the first 

ut not on the second day. The last 
result indicates that the magnitude of 
compatibility effects decreased with 
training, 

An analysis of variance for the four 
easier tasks indicated that neither the 
Use of one vs. two hands, nor filtering 
Vs. Conserving of information had a 
Significant main effect. However, 
“arning was again highly significant, 
38 was the interaction of Trials X Ex- 
Perimental Conditions. i 
, Errors.—Errors for the eight condi- 
tions (see Table 2) were less frequent 
than in the earlier study by Morin, 
Forrin, and Archer (1961). Mean 
“Tors correlated .87 with mean RTs 
“cross the eight tasks, and the four 
ne with longest RTs all had the 
"IBhest error rates. Analyses of 
Variance for the error scores for the 
eur most difficult conditions and for 
Fs Íour easier conditions indicated 

at within these sets of data differ- 
ja among tasks were not statist- 
fone reliable, However, within the 
follo most difficult tasks error rates 
as Owed a similar interaction pattern 
bi did the RT scores; in the 2 to 1 
‘t information-condensing task more 


errors were made with two hands than 
with one, with the reverse being true 
for the 2-bit information-conserving 
task. The error data thus lend sup- 
port to the findings based on the RT 
data. 

Information-transmission rates.— 
Information-transmission rates (see 
Table 2) for the 2 to 1 bit information- 
condensing task were by far the lowest 
of any of the tasks studied, the aver- 
age value of information transmission 
rate (H,/sec) being barely half that 
for 2 to 1 bit information filtering or 
for 1-bit conservation. 


DIscussIon 


For three of the four conditions that 
constitute a replication of Morin, Forrin, 
and Archer’s (1961) experiment, the 
present results agree fairly closely with 
the original data. However, this is not 
true for Cond. V. In the earlier study 
the information-condensing task, Cond. 
V, gave relatively faster RTs than those 
obtained in our replication, and also gave 
much faster RTs than those obtained for 
the 10 Ss tested under Cond. V-B of our 
experiment. One explanation is that of 
an unusually discrepant sample of Ss in 
the earlier study; this explanation is 
given credence by the fact that Morin, 
Forrin, and Archer (1961) reported that 
this group contained ". . . two highly 
atypical Ss who responded very rapidly 
and transmitted information with near- 
perfect accuracy [p. 94]." The present 
findings indicate a large superiority of 
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the 2 to 1 bit filtering task over both the 
2 to 1 bit condensing and the 2 to 2 bit 
information-conserving tasks. However, 
considering both sessions, the rank order 
of the four replicated tasks is the same as 
in the original study. Only when the 
two-hand response set is used is the rank 
of the two most difficult tasks reversed, 
this effect being statistically significant 
on the first experimental session. 

The present findings therefore support 
the conjecture that the results obtained 
by Morin, Forrin, and Archer (1961) 
were due in part to the choice of a re- 
sponse set which did not permit the 
formation of a compatible S-R ensemble. 
In view of these results, it therefore does 
not appear wise to conclude, as they do, 
that "apparently some minimum amount 
of time is needed to learn new or unusual 
codes but once they are learned, coding 
may become a relatively unimportant 
determinant of the rate of information 
processing [p. 96],” or that “the rate at 
which information can be processed can 
adequately be accounted for in terms of 
formal information measures without 
reference to the perceptual demands of 
the task [p. 95]." Instead it appears 
that performance does depend in part on 
the "perceptual demands" of different 
information-processing tasks, as Morin, 
Forrin, and Archer (1961) originally 
surmised. 

The present results, considered in con- 
junction with earlier data (Alluisi & 
Muller, 1958; Brainard, Irby, Fitts, & 
Alluisi, 1962; Broadbent, 1963; Broad- 
bent & Gregory, 1962; Deininger & Fitts, 
1955; Fitts & Seeger, 1953; Leonard, 
1961), suggest that efficiency in informa- 
tion reduction, as well as in information- 
conserving tasks, is determined in part 
by stimulus-response interactions, or 
S.R compatibility effects. 

The examination of human perform- 
ance in tasks that require selective 
information reduction, such as filtering 
and condensing information, would ap- 
pear to be an important bridge to an 
understanding of more complex percep- 
tual and cognitive processes. As an 
illustration, it appears that the de pen 
of dimensions of the pene an d 

sponse is an important determinant o 
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information-processing performance, as 
has frequently been shown to be the case 
in concept-identification and classifica- 
tion-learning studies. 
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DISCRIMINATION PERFORMANCE AS AFFECTED BY 
PROBLEM DIFFICULTY AND SHOCK FOR EITHER 


THE CORRECT OR INCORRECT RESPONSE ! 


HARRY FOWLER anp GEORGE J. WISCHNER 
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ssess one interpretation of the paradoxical facilitating effect of 
shock for the correct response, 12 groups of 8 hungry rats each received 
visual-discrimination training for food under no-shock, shock-right, 
and shock-wrong conditions and 4 levels of problem difficulty. Difü- 
culty was manipulated by varying the relative. brightnesses of the 
discriminanda. Performance was progressively retarded with increas- 
ing levels of difficulty. However, with the more difficult discrimina- 
tions, performance relative to that of the no-shock Ss was facilitated by 
the shock-right condition and more so by the shock-wrong condition. 
It is suggested that shock may have both an avoidance and cue function, 
the latter presumably providing the basis for the finding, obtained for 
the Ist time in our laboratory, that shock for the correct response 


facilitates performance. 


The present study was designed to 
assess the effect of shock for either the 
Correct or incorrect response on 
Performance in visual-discrimination 
tasks varying in level of difficulty. 
The rational for investigating the diffi- 
Culty-of-discrimination variable stems 
Tom one interpretation of the para- 
doxical facilitating effect of shock for 
the correct response examined by 
Freeburne and Taylor (1952). These 
Investigators, suggesting that shock, 
Associated only with the correct re- 
Sponse, may serve as a differential cue 
and hence as a secondary reinforcer, 
trained animals on a task involving 
Shock for both correct and incorrect 
responses, a condition precluding any 
Cue effect of the shock. The finding 
hat this shock-both group performed 
Significantly better than a no-shock 
Sroup led Freeburne and Taylor to 
Gp This study was supported in part by 
pant No. G-14312 from the National Science 

Pundation and Grant No. MH-08482 from 
te National Institute of Mental Health, 
lene States Public Health Service. = 
i acknowledgment is made by He aom 
assistar fernstein aud N. Ketterle or : 
ance in the conduct of this experiment. 


reject the cue-function interpretation 
of the facilitating effect of shock for 
the correct response and to interpret 
their data as supporting a general 
sensitizing function of shock as orig- 
inally proposed by Muenzinger (1934). 
Prince (1956) questioned Freeburne 
and Taylor's (1952) results on certain 
methodological grounds, however, and 
failed in his experiment to obtain any 
difference between shock-both and 
no-shock groups. 

In a very recent study of the effect 
of various combinations of food and 
shock for either the correct or in- 
correct response, Wischner, Hall, and 
Fowler (1964), also obtained no 
difference between shock-both and 
no-shock groups. More important 
was the finding that a shock-right 
group, a control absent in both the 
Prince (1956) and Freeburne and 
Taylor (1952) studies, performed sig- 
nificantly poorer than the no-shock 
group. This failure to obtain a 
facilitating effect of shock for the 
correct response is in accord with 
our previous findings, indicating that 
shock for the correct response serves 
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only to retard performance (Wisch- 
ner, 1947), the extent of this retar- 
dation being greater with increasing 
intensities of shock (Wischner, Fowler, 
& Kushnick, 1963). . 

The present investigation examines 
further the possibility that the facili- 
tating effect of shock for the correct 
response reported by Muenzinger and 
his colleagues (Muenzinger, 1934; 
Muenzinger, Bernstone, & Richards, 
1938) may relate to a possible cue 
function of the shock. It is proposed 
that such an effect might be obtained 
with relatively difficult discrimina- 
tions in which the alternatives, in- 
cluding the discriminative stimuli, are 
highly similar. Under these condi- 
tions, shock for either the correct 
or incorrect response may serve to 
heighten the discriminability of the 
alternatives, thereby delimiting any 
generalization of secondary reinforce- 
ment from the correct (food-rein- 
forced) goal arm to the incorrect arm, 
and thus reducing errors. 


METHOD 


Subjects.—The Ss for the experiment were 
96 naive, male albino rats of the Sprague- 
Dawley strain, about 100 days old at the start 
of the experiment. The Ss were caged in- 
dividually in the experimental room under 
controlled temperature and an artificially 
illuminated day-night cycle, 
Apparatus.—The apparatus was an en- 
closed T maze with a stem 26 in. long, and 
each arm, as measured from the center of the 
choice point, 36 in. long. The height of each 
section of the maze was uniformly 43 in. and 
the width 3} in. The initial 8-in. section of 
the stem was separated from the remaining 
portion by a guillotine door and served as the 
start compartment. Guillotine doors were 
also positioned near the end of each arm, to 
provide 12-in. goal compartments, and at the 
entrance to each arm, 3 in. from the center 
of the choice point, to prevent correction. 
Each maze section was constructed of two 
L-shaped strips of galvanized sheet metal, an 
exterior supporting shell of wood, painted 
flat black, and a transparent Plexiglas Ne 
One L-shaped strip of sheet metal served as 


one wall and half of the floor of the maze 
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section; the other L served as the other half 
of the floor and the other wall. Together, the 
two L-shaped strips provided two, 14-in. 
floor surfaces which were separated by a 
í-in. gap. í 

Each goal compartment had a clear Plexi- 
glas food well (1 in. in diameter X 4 in. deep) 
and an end plate or wall made of frosted 
Plexiglas. The frosted, goal-box end plates 
were visible from the choice point and served 
as the discriminative stimuli when each was 
differentially illuminated by a frosted 10-w. 
bulb located directly behind the end plate in 
a wooden enclosure painted a flat black. = 

To effect different levels of problem diffi- 
culty, the brightnesses of the two goal-box 
end plates were varied by operating one of the 
goal-box bulbs at 120 v. and the other at 
either 96, 87, 78, or 63 v. These four sets of 
voltages for the dim and brightly illuminated 
goal-box end plates provided footcandle 
ratios of .50, .37, .25, and .12, respectively. 
Hereafter, reference will be made to these four 
sets of differential goal-box brightnesses as the 
Difficult (D), Medium Difficult (MD), Me- 
dium (M), and Medium Easy (ME) problem 
conditions, respectively. Additional illumina- 
tion, exterior to the maze, was provided by 
two 10-w. bulbs (120-v. source), one lopan 
near the end of each arm on the stem side © 
the maze. 

A matched-impedance shock system con- 
sisting of a 60-cps ac source and a series 
resistance of .3 megohms was used to den 
shock to S in either maze arm. In this shoc t 
system, the two L-shaped strips of shee 
metal forming the sides and floor of the maze 
were connected across the output of the trans 
former; thus, S received shock when it made 
contact with both halves of the sheet-met? 
floor. Because of the narrowness of the maz 
alley, S could not avoid shock by running 
along only one half of the floor surface. em 

Shock was delivered to S when it nec 
rupted in either arm, an infrared photoelectry 
beam crossing the arm at a point midwa) 
between its two guillotine doors. A p 
priming feature of the shock circuit preven by 
S from receiving more than one shock ing i 
one trial, even though it was possible for - Pon 
interrupt a photoelectric beam several time 
during a trial. When the shock was delivers 
its duration was held constant at .2 sec- 3 
metered through a Hunter timing relay: x 
its intensity was set at 60 v., as measur! 
across the output of the transformer. dure 

Procedure.—The experimental proce! ie 
included both pretraining and training phas A 
One week prior to the pretraining phase: 


Peer he 
were started, and then maintained for t 
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duration of the experiment, on a daily diet 
of 12 gm. Purina lab checkers, with water 
available ad libitum. During this week each 
S was handled daily for about 3-4 min. 
For both the pretraining and training trials, 
Ss were kept in individual compartments of a 
hardward cloth detention cage. Prior to their 
daily run in the maze, which took place during 
the dark phase of the artificial day-night cycle, 
all Ss received approximately 5 min. adapta- 
tion to the conditions of illumination of the 
room as provided by the two exterior 10-w. 
apparatus bulbs. 
Pretraining was administered to habituate 
S to the apparatus and to reduce possible 
Position and brightness preferences. Since 
the latter would presumably relate to the 
different brightnesses of the discriminative 
visual stimuli, the 96 Ss were randomly as- 
Signed to four equally numbered groups, each 
of which was run, during both pretraining and 
training, under one of the four different prob- 
lem conditions. 
For pretraining, each S received a total of 
16 forced, reinforced trials, administered 4 per 
day and randomly distributed with the re- 
Striction that half of the trials were to the left 
and half to the right, with half of each of these 
Sets being to a bright goal and half to a dim 
Soal. Forcing was accomplished by lowering 
ne of the guillotine doors at the choice point. 
e reinforcement provided on each trial 
Consisted of p, J. Noyes Formula A rat pellets 
mm., 45 mg.). On the first of the 4 pres 
training days, the food pellets were liberally 
Spread throughout the goal box; on subse- 
Quent days, the number and locus of pellets 
Were systematically reduced until, on the lank 
ay, two pellets were given only in the goar 
box food cup. With each forced trial, S was 
Permitted to run to the available goal where 
Was detained until all the pellets were 
Consumed or until 5 min. had elapsed. ‘Then, 
Was removed from the maze and repre 
© carrying cage to await its next trial. 
k = average, the interval between successive 
e Within a day was about 15 min., the time 
“quired to run a block of Ss. 7 
er ttining began on the day zm oe 
ti timation of pretraining and consisted © 
rely of free-choice trials. For the bir 
«ase, the 24 Ss of each of the four problem. 
difficulty conditi vere randomly assigned 
to th onditions we! k subgroups: 
Tee equall nbered shoc 
shoc € equally nun SW), and 
n Cl-right (SR), shock-wrong (S d both 
shyshock (NS). Shock-right 5s Bo Masta 
resp, and food reinforcement for a sponse; 
s om but neither for an ingore ge 
fo, Ck-wrong Ss received food reintorc 


Correct response and shock for an 1n- 
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correct response; the control, no-shock Ss 
received only food for a correct response. 

All Ss received four trials per day for the 
first 6 days of discrimination training and 
eight trials per day thereafter. During these 
trials, food reinforcement, consisting of two 
pellets, could be obtained only in the bright 
goal, the right-left position of which was 
varied throughout training in accordance with 
a predetermined random schedule. On any 
training trial, detention time in either the 
correct or incorrect goal was approximately 
10 sec., and the interval between successive 
trials within a day about 15 min., again the 
time required to run a block of Ss. Dis- 
crimination training was continued until each 
S met a criterion of 15 correct responses out 
of a total of 16 free choices with the last 8 
being correct, or until a total of 400 training 
trials had been administered. 


RESULTS AND DISCUSSION 


Group mean errors in blocks of 20 
trials are presented in Fig. 1. The 
separate panels of this figure provide 
the mean error scores for the SR, SW, 
and NS groups within each of the 
problem-difficulty levels. As shown, 
mean errors decreased over the course 
of training for all groups except the 
NS group of the difficult problem. 
Overall, the rate of error reduction 
tended to be progressively slower, the 
more difficult the discrimination task. 
More important is the fact that, 
within each difficulty level, both the 
SR and SW Ss showed more rapid 
elimination of errors than did the 
control, NS Ss. 

Because of the generally orderly 
progression of the performance curves 
for the three shock groups within each 
difficulty level, only group mean errors 
for the entire 400 trials were subjected 
to statistical analysis. These data are 
presented for the three shock groups in 
Fig. 2 as a function of task difficulty. 
Included in this figure for the “E” or 
easy problem are data from a previous 
study (Wischner et al., 1963). The 
significance of these additional data, 
presented for comparison purposes, 
will be considered below. 
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and no-shock 


The results of a 3 X 4 factorial 
analysis of variance of the data pre- 
sented in Fig. 2 for the present study 
showed that group mean total errors 
were positively related to level of task 
difficulty; both the differences am 
the several difficulty levels and the 
linear trend in the means for these 
levels were highly reliable; F (3, 84) 
= 37.2, b < .001 and 'F (i, 84) 
= 1114, p < .001, respectively, Re. 
sidual trends after extraction of the 
linear component were all nonsignifi- 
cant, F « 1. The overall differences 
among the three shock conditiong were 
also highly reliable, F (2, 84) = 16.6, 
p < .001 and, as indicated by a non- 
significant Shock X Difficulty Level 
interaction, F < 1, related generally 
to the range of difficulty levels studied. 
In addition, separate comparisons, 
based on the overall error term, 
showed that the difference between 
any pair of shock groups was reliable: 


ong 


» per 20 trial blocks, for the shock-right (SR), shock-wrong (SW), 
(NS) groups within each conditior 


n of problem difficulty. 


NS vs. SR, F (1,84) = 114,5 < p 
NS vs. SW, F (1,84) = 32.7, p < EON 
SR vs. SW, F (1,84) = 5.5, 5 < .025. 


In contrast to the data of the quod 
investigation, those presented in Fig. : 
for the Wischner et al. (1963) study show 
that the performance of the SR group 
was retarded relative to that of the NS 
group. These additional data are for 
comparable groups of Ss run in „the 
same apparatus under training conditions 
which, for all purposes, were identical to 
those employed in the present study, 
except for the fact that the problem ame 
ployed was a light-dark discrimination 
rather than one of the present set O 
bright-dim discriminations. For this 
reason, these additional data have been 
Presented in Fig. 2 as being representa 
tive of the effects of shock for either the 
correct or incorrect response on pétiorm 
ance in an "easy" discrimination task- 
In connection with these data, it shoul 
be noted that other data from the 
Wischner et al. study, not included 1” 
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o:WISCHNER, et al., 1963 
180 — «: PRESENT STUDY 


ME M Lr D 
LEVEL OF DIFFICULTY 


Fic. 2, Mean errors over the entire 400 
training trials for the SR, SW, and NS groups 
as a function of level of problem difficulty. 
The data presented for the "E" or easy 
Problem level are for comparable groups of 
Ss run in a previous study [Wischner et al., 
19637.) 


Fig. 2, showed an even greater retarda- 
tion of performance for SR animals when 
either a forced-choice training procedure 
Or a higher intensity of shock was em- 
Ployed in their easy problem condition. 
In total, the data presented in Fig. 2 
Suggest that, within the context of dis- 
Crimination learning, shock may serve, at 
least, two functions. Through its aver- 
Sive property, shock may produce avoid- 
ance of the stimuli with which it Is 
associated, thereby increasing errors for 
Ss and reducing them for SW Ss. 
Through its cue property, however, shock 
Or either the correct or incorrect response 
may increase the discriminability of the 
goal arms and thereby facilitate perform- 
ance. This cue effect of shock should be 
Particularly apparent when the dis- 
Criminative stimuli are highly similar, as 
in the case of the difficult discriminations 
*mployed in the present study. With 
this arrangement, the secondary reim- 
Orcing effects of the cues present 1n the 
Correct goal arm may generalize to those 
ìn the incorrect arm, to impede formation 
9f the discrimination. But, when shock 
55 consistently administered for one re- 
SPonse, both the shock experience, itself, 
and any fear elicited by it may serve to 
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make the two goal arms more distinctive, 
thereby reducing the generalization of 
secondary reinforcement from the correct 
to the incorrect goal arm and thus re- 
ducing errors. In contrast, when the dis- 
criminative stimuli are quite dissimilar, 
as in the simple or easy discrimination, 
the cue effect of shock should be rela- 
tively small, and in the case of the SR 
condition, possibly be offset or obscured 
by the shock's aversive or avoidance- 
producing property. 

In view of the present results, and also 
the fact that Muenzinger and his col- 
leagues consistently obtained facilitation 
with their SR conditions, certain aspects 
of Muenzinger's (1934) general procedure 
are particularly noteworthy. Muen- 
zinger always employed a simple black- 
white discrimination, but his animals 
were required, following a correct choice, 
to run from the positive discriminative 
stimulus into a gray goal where food 
reinforcement was obtained. This train- 
ing procedure would seem to enhance 
the "difficulty" of the discrimination, in 
so far as it promotes a temporal dissocia- 
tion of the positive discriminative stim- 
ulus and the food reinforcement, and also 
affords a greater similarity between the 
gray goal-box cues associated with food 
and those present in the incorrect arm. 
Under these conditions, then, as in the 
present study, the SR condition could be 
expected to facilitate performance, since 
the shock experience, itself, and any fear 
elicited by it, would serve both to 
mediate the reinforcing effect of food and 
to minimize any generalized secondary 
reinforcing effects of the cues present in 
the incorrect arm. 

Whether the above interpretation may 
fully account for Muenzinger’s results is 
obviously a matter for additional study 
of the variables relevant to the problem- 
difficulty dimension. It should be ap- 
parent that, as related to the rate of 
acquisition of a discrimination, the di- 
mension of problem difficulty may be 
manipulated by varying the types of 
factors suggested to be operative in 
Muenzinger's situation or by varying the 
similarity of the discriminanda, as in the 
present study. Aside from these con- 
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siderations, however, the findings of the 
present study seem to be of significance 
for general research in the area. Church 
(1963) has commented in his recent 
review of the varied effects of punishment 
that “an important contribution would 
be made by the identification of a pa- 
rameter that would result in facilitation 
by punishment at some values and in- 
hibition by punishment at other values 
[p. 388.]" The results of the present 
study indicate that the dimension of 
problem difficulty is such a parameter; its 
isolation now makes possible an assess- 
ment of the degree to which other vari- 
ables may contribute either to the 
facilitating or disrupting effect of shock 
for the correct response. 
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THE REINFORCEMENT RELATION AS A FUNCTION 


OF INSTRUMENTAL RESPONSE BASE RATE? 


ROBERT W. SCHAEFFER ? 


University of Missourt 


This experiment investigated the relationship between base rate (i.e., 
operant level, or precontingent rate) of the instrumental response and 
asymptotic reinforced responding. Running was used to reinforce an 
instrumental response of licking. Variations in the base rate of the 
instrumental licking response were produced by varying the contents of 
the drinking tube, while base rate of the reinforcing running response 
was held constant. The effects of variations in the base rate of the 
instrumental response on the reinforcement relation were examined for 
several FR reinforcement schedules and for extinction, both with respect 
to asymptotic reinforced licking, and with respect to the reinforced 


increments in contingent licking. The results indicated that asymp- 
totic reinforced instrumental r 


the instrumental response, and th 


tingent instrumental respon 
the instrumental response an 


, Although numerous studies have 
Investigated the changes associated 
With acquisition and extinction of the 
instrumental response under a variety 
of reinforcement schedules, the possi- 
ility that the base rate (i.e., operant 
level, or precontingent rate) of the 
instrumental response might itself 
influence the outcome of the reinforce- 
ment relation has been largely ignored. 

eglect of the instrumental response 
ase line as a potential contributor to 
the reinforcement relation is undoubt- 
edly related to the general theoretical 
Conception that some events are, and 
Some events are not reinforcers (i.e., 
Some responses may be used only as 
instrumental responses, while other 
TéSponses may be used only as rein- 
°rcing responses) Within this con- 
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esponding varied directly with base rate of 
at the reinforced increments in con- 
ding were determined both by base rate of 
d the reinforcing response. 


text, reinforcing responses have been 
treated as absolute, categorical, and 
transituational events (cf. Meehl, 
1950; Miller & Dollard, 1941) which 
have well-established distributional 
properties and which, when made con- 
tingent upon a given instrumental 
response, actively alter the distribu- 
tional properties of the instrumental 
response. Instrumental responses, on 
the other hand, have been treated 
solely as passive responses which, al- 
though they are affected by the rein- 
forcing response, have no effect upon 
the reinforcing response. Recently, 
however, Premack (1963) obtained 
evidence suggesting that asymptotic 
reinforced response rate varies directly 
with the base rate of the instrumental 
response. The purpose of the present 
experiment was to inquire further into 
this relation. 

An adequate test of this problem 
requires a procedure under which an 
extensive range of the independent 
variable (i.e., instrumental response 
base lines) may be produced. Differ- 
ential weighting of the bar, a tradi- 
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tional method of varying the base rate 
of bar pressing, is procedurally unsat- 
isfactory because of the very limited 
range of responding that may be pro- 
duced in this manner. A satisfactory 
method for investigating this problem 
was provided, however, by a reinforce- 
ment situation originally suggested by 
Premack (1962). In the Present ex- 
periment rats were placed into an 
activity wheel into which a drinking 
tube extended. The wheel was locked 
and Ss were required to lick in order 
to have the opportunity to run. The 
base rates of licking were manipulated 
by varying the contents of the drink- 
ing tube, while base rate of running, 
the reinforcing response, was held 
constant. This arrangement proved 
ideal for testing the problem; the pre- 
contingent probability of running was 
greater than that of base-line drinking 
for all tube contents, thus insuring 
reinforcement in the contingent situa- 
tion, while the probability of base-line 
drinking varied substantially as a 
function of tube content, thus provid- 
ing the desired wide range of the 
independent variable. 


METHOD 
Experimental Design 


The Ss were first given a 
adaptation, then b, 
taken of running al 
and running and 
The Ss were then 


tested on the appropriate tube contents for 5 
days. Finally, the main experimenta] design, 
involving a series of paired base-rate sessions 
preceding each FR schedule, was instituted, 


Subjects 


The Ss were 16 female Sprague-Dawley 
rats, about 100 days old. They were indi- 
vidually housed, and maintained on ad-lib 
Purina mash and water. 
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Apparatus 


Four modified Wahmann activity wheels 
that have been described in detail chew Gr 
(Premack & Schaeffer, 1962) were used nd 
wheel adaptation. The apparatus used d » 
experiment proper has also been describe e. 
detailelsewhere (Premack, Schaeffer, &Hundt, 
1964; Schaeffer & Premack, 1961). Briefly, 
this device consisted of a Wahmann activity 
wheel and drum assembly mounted in e" 
insulated ice box hull, and a retractable vx ah 
ing tube assembly. This test cham is 
permitted successive and simultaneous aes 
ing of running and/or drinking, and in 
Programing of contingencies between drinki 5 
and running. Each one-quarter wheel aie 
and each lick were recorded on electric coun 
ers and an Esterline-Angus recorder. 


Procedure 


Wheel adaptation.—The Ss were given is 
days of wheel adaptation in the: dots 
wheels, to permit running to stabilize s 
mack & Schaeffer, 1962, 1963). During d a 
Period Ss were run for a 1-hr. A.M. an all 
l-hr. P.M. session daily; thereafter, for the 
conditions they were run twice daily, In 
A.M. and the P.M., for a 10-min. session. -— 

Base rates and breconditioning.—Base rà n 
were obtained, in the following order, " 
running alone, water drinking alaner te 
running and water drinking together. FR 5 
all Ss were conditioned for 5 days on 1 for 
to lick the water tube to free the whee ; o 
10 sec. The Ss were then given 1 can 
extinction, Primarily to extinguish nonlick 
behaviors (Premack, 1962). vided 

The Ss were then matched and be 
into four equal groups. Each group recei nts 
Only one of four different tube cnc 
throughout the experiment. The tube 4 % 
tents were: empty tube (MT), tap water, 
Sucrose by weight, and 32% sucre. e 
Weight. Matching of the Ss was pea pre- 

Y using the preceding base-line an ro 
conditioning measures. Mean number oe 
Wheel turns averaged over the last six sess an 
of the base-line period in which running * 
Water licking were simultaneously d. for 
to Ss were: 454.8, 429.6, 502.4, and 442.5, ec- 

roups MT, water, 4%, and 32%, ree cien 
tively, Mean number of instrumental ing 
averaged over the last two precondition L 
Sessions were: 271.2, 264.7, 253.2, and ec- 
for Groups MT, water, 4%, and 32%, Ms h 
tively, Analyses of variance made on vel: 
Sets of data failed to approach the 10% p 
thus, the groups did not differ either in 
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rate of running or in their general condition- 
ability for the lick-to-run contingency. 

Conditioning series.—After being assigned 
to one of the four main groups, each S was 
Presented with the appropriate tube content 
for 5 days. These sessions provided base-rate 
measures of drinking for each of the tube 
contents. Thereafter, all four groups were 
subjected to the main experimental conditions 
which are summarized in Table 1. As may be 
Seen in Table 1, the experiment consisted of 
three conditioning series: an FR 5, an FR 41, 
à return to FR 5, and finally, extinction. In 
each conditioning series, completing the FR 
freed the wheel for 10 sec. Preceding each 
Conditioning series were a number of paired- 
response base-line sessions, i.e., sessions in 
which both the wheel and the tube were 
simultaneously available so that S could freely 
Choose between them. The purpose of these 
Paired base-line sessions was twofold: (a) they 
Provided a recurrent check on the equality of 
base-rate running among the four groups, and 
(0) they provided a recurrent measure of base- 
Tate licking from which increments in in- 
Strumental licking in the contingent situation 
Could be evaluated. 

With the exception of extinction, each 
Stage of the experiment was carried to asymp- 
tote. Asymptote was considered attained 
When the submeans of licking in the first three 
and the last three sessions of six consecutive 
Sessions differed by less than 10% from the 
Brand mean of the six sessions. In addition, 
analyses of variance of licking in the A.M. and 
P.M. sessions of the last 3 days of all conditions 
Preceding extinction indicated that there were 


TABLE 1 


SEQUENCE oF EXPERIMENTAL CONDITIONS 
AND NUMBER OF SESSIONS 
PER CONDITION 


Number of 
; ions 
Conditions Sess 


First paired base line (i.e., running 
and drinking simultaneously and 16 
Teely available to S) 


FR 5, lick-to-run contingency is 
Second paired base line 20 
FR 41, lick-to-run contingency 20 
Third Paired base line n 
5 replication, lick-to-run 2 
Contingency E 
Extinction 
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no significant Within-groups differences in 
licking between sessions. All of the data 
presented in this report are based entirely 
upon asymptotic responding in the last six 
sessions of each experimental condition. [i 


RESULTS 


Contingent licking.—Asymptotic 
reinforced instrumental licking as a 
function of base-line licking in the 
first paired base-line condition is 
shown in log-log coordinates in the 
lower panel of Fig. 1. The typical 
effects of reinforcement are seen 
clearly in the parameter, FR require- 
ment, in this figure; licking for each 
group in the FR 41 exceeded licking 
for that group in either FR 5, and ex- 
tinction produced higher licking rates 
for each group than any FR. Both 
contingent and extinction lick rate 
appear to be increasing monotonic 
functions of base-rate licking. Anal- 
yses of variance (Treatments X Ses- 
sions X Ss) of total licking in the last 
six sessions indicated that the trend 
seen in the licking curves was signifi- 
cant at less than the .005 level. The 
F values obtained for the FR 5, the 
FR 41, the FR 5 replication, and 
extinction, respectively, were 13.07, 
20.50, 15.45, and 39.16. In all cases, 
df = 3/12. No significant sessions or 
interaction effects were found in any 
analysis. 

Although the results are not shown 
here, replotting each of the licking 
curves seen in Fig. 1 as a function of 
base rate of licking in the immediately 
preceding paired base-line condition 
did not alter the increasing monotonic 
relation seen in Fig. 1. 

Reinforced increments in licking — 
Although the effects of reinforcement 
have traditionally been evaluated by 
examination of asymptotic reinforced 
instrumental response rate, a more 
suitable method, when base-line meas- 
ures are employed, is to examine the 
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reinforced increment in contingent re- 
sponding (i.e., asymptotic reinforced 
instrumental response rate minus the 
baserate of theinstrumentalresponse). 
As was shown in Table 1, paired re- 
sponse base-line sessions preceded 
each FR. Although the original base 
rates of the reinforcing response, 
running, were recovered in the subse- 
quent paired base-line sessions, the 
original base rates of the instrumental 
response, licking, were not. By 
evaluating the reinforced increments 
in contingent licking (i.e., asymptotic 
reinforced licking minus base-rate 
licking in the paired base-line condi- 
tion immediately preceding the rein- 
forcement condition), these changes 
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Fic. 1. Mean instrumental licks per 
session as a function of mean licks per session 
in the first paired base-line condition. (The 
upper panel shows mean contingent running 
per session in each FR.) 
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in base-line licking were taken into 
account. 

The reinforced increments in in- 
strumental licking as a function of 
licking in the paired base-line condi- 
tion immediately preceding each FR 
and the extinction series are shown in 
log-log coordinates in Fig. 2. Each 
point in the curves represents the 
mean of the differences between in- 
strumental licking and base-line lick- 
ing in the paired base-line seesion 
which immediately preceded each FR 
series and the extinction condition. 
As can be seen, the magnitude of rein- 
forced increments in instrumental 
licking increases with base rate of lick- 
ing up to a point, after which they 
either level off or show a slight decline. 
More data points are needed from 
base lines higher than those of the 
present study to determine if the 
function is actually monotonic up tO 
asymptote, or whether the relation 19 
actually curvilinear., Separate anal- 
yses of variance of the curves of least 
and greatest slopes, the FR 5 replica- 
tion and extinction cases, respectively, 
indicated that the trend seen in the 
former was not significant, F (3, 1 
7144, p < .20, whereas that for €x- 
tinction was significant, F (3, 12) 
= 6.54, p < 01. 

Of further interest was the finding 
that replotting the reinforced incre- 
ments in licking in each contingency 
and extinction as a function of base- 
rate licking in the first paired base-line 
condition did not produce curves 
substantially different from those see" 
in Fig. 2. Although the absolute 
magnitude of the increments foun 
with this method differed from those 
seen in Fig. 2, the shape of the func- 
tions obtained did not. 

Contingent running—The mea? 
rates of the reinforcing running Y€* 
sponse in each FR contingency °° 
Shown in the upper panel of Fig. 
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Fic, 2. Reinforced increments in instrumental licking as a function of mean licks per 
session in the paired base-line condition immediately preceding each contingency. (The 
subscripts 1, 2, and 3 by each group on the x axis refer to the first, second, and third paired 
base-line conditions, respectively. Thus, the FR 5 curve 1s plotted as a function of licking in 

: 1 curve is plotted as a function of licking in the second 


the first paired base line, the FR 4 ipm 
paired Ee line, and the FR 5 replication and extinction curves are plotted as a function of 
, 8 


licking in the third paired base line.) 


As can be seen, there w did these differences in contingent 
difference between the groups on COn- running approach statistical signifi- 
tingent running associated with all cance, F (3,12) = 3.28, p < .10. 
three FR conditions, with contingent The consistent differences between 
: : ath i 3 tingent running might 
runnin; ang directly with instru- groups on con 
varying be due to differences 1n amount run 


men Te Mean number of runs i ; 
per eae e empty tube, water, per opportunity to run; or, to differ- 
32%, and 4% groups respectively, for ences in momentary rate of running; 
the FR 5 ad FR 5 replication, re- or, to differences in number of op- 
Spectively, were 269.9, 276.3, 278.9, portunities to run. In order to 
280.6; and 268.9, 291.6, 328.0, and establish the locus of these differences, 
336.9. Mean number of runs per Esterline-Angus records from the last 
Session in the FR 41 were 76.7, 188.4, two sessions of the FR 5 and FR 41 
268.9, and 425.7. Only in the FR 41 were examined. The number of trials 


as a numerical 


O 
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on which the FR was completed and 
followed by running, and the number 
of runs per trial were tabulated. 
Analyses of variance of mean runs per 
trial in the first 5, and last 5 min. 
of these sessions indicated that the 
groups did not differ in amount run 
per opportunity torun. Furthermore, 
the groups did not differ on momen- 
tary rate of running; over 80% of all 
completed wheel turns occupied 1 sec. 
or less. The groups did differ, how- 
ever, on number of opportunities to 
run. Mean number of trials on which 
the FR was completed and followed 
by running for the empty tube, water, 
32%, and 4% groups, respectively, 
for the FR 5 and FR 41, respectively, 
were 10.7, 11.1, 12.4, 12.7; and 2.9, 
6.7, 11.9, 20.2. Of interest was the 
fact that number of opportunities to 
run varied directly with instrumental 
licking, indicating that in the con- 
tingent case the occurrence of the 
reinforcing response was itself par- 
tially dependent upon instrumental 
response occurrence, 
In summary, asymptotic reinforced 
instrumental licking varied directly 
with base-rate licking. Reinforced 
increments in instrumental licking 
(reinforced licking minus base-line 
licking) appeared to be possibly a non- 
monotonic function of base-line lick- 
ing, with intermediate base rates 
resulting in the maximum increment, 
and smaller increments associated 
with both lesser and greater base rates 
of licking. The groups did not differ 
on base rates of running; the original 
base rates of running were recovered 
in the subsequent paired. base-line 
sessions. The groups did differ, how- 
ever, on rate of contingent running in 
all three FR cases, although the differ- 
enigs approached statistical signifi- 
cance only in the FR 41. Contingent 
running was found to vary directly 
with instrumental licking, and was 
i: ion of the number of 
strictly a functio 
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opportunities to run, there being no 
differences between the groups in 
either momentary rate of running or 
amount run per opportunity to run. 


DiscussioN 


The contention (Premack, 1963) that 
variations in the precontingent rate of 
the instrumental response influence the 
outcome of the reinforcement relation is 
borne out by the results of the present 
study. The effects of variations in 
instrumental response base rate are re- 
flected not only in asymptotic reinforced 
responding, but in the magnitude of 
the reinforcement increment, and in the 
number of times the reinforcing response 
may occur in the contingent situation. 

The stability of the relation between 
base-rate instrumental responding and 
reinforced responding is attested by the 
fact that it was found to hold for several 
FRs and for extinction. The possible 
generality of this finding is suggested by 
apparently comparable data obtained in 
situations differing markedly from that 
employed in the present study. Premack 
(1963) found a comparable relationship 
in monkeys, with preferred and non- 
preferred manipulation responses as the 
reinforcing and instrumental responses, 
respectively. Asdourian (1962) has also 
reported what appears to be a com- 
parable effect; septal stimulation in rats 
failed to reinforce licking water, but did 
reinforce licking a sucrose solution. With 
respect to the results of the present study, 

sdourian's results may be attributable 
to the fact that the base rate of drinking 
a sucrose solution is considerably greater 
than that for drinking water. The re- 
sults of both preceding studies, like those 
of the present study, indicate that the 
reinforcement increment is not deter- 
mined solely by the reinforcer, but rather, 
is determined both by the reinforcer 


and the base rate of the instrumental 
response, 
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DELAYED COLD-INDUCED VASODILATATION 
AND BEHAVIOR! 


WARREN H. TEICHNER 


University of Massachusetts 


The latency of cold-induced vasodilatation of th 


a risk of shock. 


When the body is kept warm and 
the hand is immersed in cold air or 
water, there is an immediate vasocon- 
striction in the fingers which may be 
observed as a rapid drop of surface 
temperature of the fingers. [n 1929, 
Lewis observed that with continued 
immersion, a reflexive vasodilatation 
occurs. That is, after some time, the 
veins dilate bringing warm blood to 
the surface. This may be observed 
as a sudden reversal of the decreasing 
finger-temperature curve. Over a 
period of prolonged immersion, there 
may occur a phasic vasoconstriction- 
vasodilatation in the finger which is 
exhibited as a sinusoidal-like finger- 
temperature curve, This phenom- 


are large differences. 
show the classical req 
others show a single vasodilatation to 


l'This research was Supported by hes 
National Institute of Mental Health Grant 
} M4201. 
ES Dahlquist, Terrence Walsh, and 
Edward Youngling acted as research assist- 
ants, 


some upper limit of warming after 
which their hand cannot be cooled 
again; some rewarm after a brief im- 
mersion, while others require a rela- 
tively prolonged immersion. Finally, 
Some people fail to show the hunting 
reaction at all and simply cool down 
to the level of the air or water. These 
individual differences have not yet 
been explained, . 
Meehan (1957) experimenting with 
daily hand cooling curves, reported an 
unusual result with one of his Ss. On 
the first 2 days of immersion, S, à 
college Student, provided identical 
20-min, cooling curves, 


were characterizeq by the usual nega- 
tively 


except that the amount of rewarming 
Was somewhat less. On the fourth 
day Meehan reported that S arrived, 

aving just completed an important 
examination, He appeared very tense 
and was visibly agitated. Neverthe- 
less, Meehan conducted the cooling 
test as usual. On this day S showed 
no vasodilatation whatsoever. The 
Cooling curve dropped monotonically 
toan asymptote at the temperature of 
the water, Furthermore, although 
the experience is normally reported a3 
painful, S experienced so much pain 


on this day that he refused to return 
for further tests, 
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COLD-INDUCED VASODILATATION IN AROUSAL 


The cold-pressor test has been avail- 
able for a considerable time as an 
index of a hyperreactive vasomotor 
system and has been studied with 
ambiguous results in connection with 
emotional states. Meehan's observa- 
tion goes considerably beyond the 
implications of the cold-pressor test, 
however. If verified, it has implica- 
tions for the problem of individual 
differences in cold acclimatization and 
Suggests an emotional basis for these 
differences. It also has implications 
for the psychophysiology of arousal. 
That is, it is conceivable that differ- 
ences in the parameters of the cooling 
Curve may be characteristic of emo- 
tional or arousal states. This study 
was initiated, therefore, to explore the 
Possibility of relationships between 

€wis waves and behavioral measures. 


EXPERIMENT I 
Method 


Subjects.—Vifty-four undergraduate college 
Students, male and female in equal number, 
enrolled in the summer session, were used as 
Ss. The Ss were paid for their participation. 

Apparatus.—Since the parameters of the 

&nd-cooling curve depend importantly on 
the body's thermal balance and since the 
unting reaction is elicited most easily when 
the body is warmer than usual, the experiment 
Was conducted in a temperature-controlled 
Chamber maintained at 90° F., 50% RH. The 
Water bath, controlled at a temperature of 
A F., was stirred slowly and automatical n 
The S sat with his right arm hanging ee 
pt his side and, during immersion, with = 
and through a port into the water up to t 4 
Wrist. No contact was made with the sides o 
the Water bath by the hand nor of the chair 
With any part of the arm (soas not to influence 
Circulation). The end of a 24-gauge copper- 
Constantan thermocouple taped to the index 
nger Provided a continuous temperature 
“Urve on a Leeds and Northrup ""Speedomax. 
N externa] switching device permitted = 
| ent of water temperature from a secon 
tq MOcouple taped to the same finger EI 
heprotrude .5 in. beyond the nail. Elec d 
el Ock was provided with an inductorium an 
sigo Odes placed on the ventral an E : 
“SS of the right arm just below the elbow. 


& 
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Shock levels, preselected by E as strong and 
very weak were calibrated by measuring the 
output of the inductorium at different posi- 
tions through a 151,000-ohm load. Through 
this load a strong shock was 1,660 v., peak 
current of 11 ma.; weak shock was 242 va 
peak current of 2 ma. Shock duration, con- 
stant at 2.0 sec. was controlled with an 
electronic interval timer. 

Procedures.—To approach face validity in 
the sense of a genuine stress, Ss were told that 
they were participating in the evaluation of a 
harmless, pain-killing drug. They were pro- 
vided with a 5-point rating scale with which 
they were to report intensities of pain when 
they felt a change in pain status. They were 
also told that the cold immersion was a 
standard pain test and that in addition to 
evaluating the drug in regard to it, it was 
desired to evaluate the drug's effectiveness in 
reducing the pain of electric shock. 

The experiment was carried out in four 
successive, daily, test periods. Each S was 
always run at the same time of day. Each 
test period required S to sit quietly for 20 min. 
prior to immersion in order to raise body 
warmth level. Ata verbal signal he immersed 
his hand. The S was in isolation in the 
chamber, but could make his verbal reports 
or ask to be released with an intercom which 
carried his voice to E who watched him 
through a one-way window. 

The Ss were assigned at random into a 
strong-shock group, a weak-shock group, and 
a no-shock group. The two shock groups 
received shocks on the second and third 
experimental days 15 min. after immersion of 
the hand into the water. On the second and 
third days, and for the no-shock group on all 
days, the electrodes were not attached. A 
"drug" consisting of a small measure of 
colored water was taken orally on the second 
and third days. The Ss knew every detail of 
the experimental procedures except when the 
shock would be administered during immer- 
sion and that they were being given the same 
substance as drug on the second and third 
days. They were told that a drug and a 
placebo were being administered in balanced 
fashion on those 2 days but not told when 


each was given.? 


Results 
Of a total of 54 starting Ss, 13 were 
not completed. Of these, 7 were lost 


? After completion of the experiment, Ss 
were informed of the actual nature of the 
experiment. 
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before any immersion and 6 quit dur- 
ing or after the first immersion. These 
last 6 Ss will be described briefly 
below. 

Since it was desired to classify Ss 
according to the nature of their cool- 
ing curves, but independent of the 
experimental treatments, an examina- 
tion was made of the distributions of 
the parameters of the curve obtained 
on the first day. Among the param- 
eters examined were the latency of 
vasodilatation, the rate of cooling to 
vasodilatation, the minimum finger 
temperature, maximum rewarming, 
rate of rewarming, and length of cycle. 
A vasodilatation was defined as a 
sudden warming of the finger of at 
least 3° F. A number of correlations 
were suggested, but the measures 
which served to discriminate among 
Ss most clearly were the latency of 
vasodilatation and the cooling rate to 
vasodilatation. This is in agreement 
with a similar analysis performed by 
Yoshimura and lida (1950). Since 
latency can be obtained from the tem- 
perature records by inspection, it was 
used for classification. 

Using latency to vasodilatation 
during the first exposure, the 41 com- 
pleted Ss were classified into three 
groups: (a) those with a latency of 
<270 sec., (b) those with a latency of 
270-400 sec. inclusive, and (c) those 
with a latency of > 400 sec. The 
resultant distribution of Ss in the ex- 
perimental design is shown in Table 1, 

Using latency of vasodilatation as 


TABLE 1 


NUMBER OF Ss IN Exr. I 


< 270 | 270-400 


> 400 N 
No shock 4 8 6 18 
Weak shock 2 6 5 13 
Strong shock 4 5 1. 10 
N 10 19 12 41 
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DAYS 


Fic. 1. Latency of vasodilatation of 
the 270-400 sec. group. 


the dependent measure presented @ 
logical problem. If the effect of the 
treatments were to reduce the latency» 
then the < 270-sec. group was re- 
stricted by already having a very 
short latency. Conversely, the > 400- 
sec. group contained Ss who never 
showed a Lewis wave and if the effect 
of the treatments were to be an 10- 
crease in latency, then this grouP 
could show no change. Thus, the 
clearest effect of the treatments WA 
most likely to be seen in comparison? 
within the 270-400 sec. group. These 
data, therefore, are those to be 
presented, 

Figure 1 presents the mean time tO 
vasodilatation of the three treatment 
groups on each day. The figure shows 
that the duration of vasoconstrictio? 
was reduced in the control group after 
the first experience. In the weak- 
shock group it was increased on Day ^ 
but reduced thereafter although * 
never reached the latency of the ne 
shock group. The high-shock group 
started with a slightly shorter latency 
on Day 1 than the other two grouP ^ 
but exhibited a marked prolongat!© E 
of vasoconstriction on Day 2, 2 
crease of about 100 sec. between s 
The latency on the remaining 2 t 
was shorter than that of Day 2» sid 
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sull represented a great increase over 
Day 1 and over the other two groups. 
Figure 2 presents the same results 
using rate of cooling to vasodilatation 
as the dependent variable. Since this 
measure takes account of both latency 
and temperature, it must be inter- 
preted differently than either alone. 
That is, here a decreased rate repre- 
sents an increased or prolonged vaso- 
constriction; a decreased or shortened 
vasoconstrictive effect is shown by an 
increased rate. As may be seen, using 
this measure which smoothed the data 
considerably, allows generalization of 
the latency results to cooling rate. 
The pain reports elicited from Ss 
were highly variable and difficult to 
interpret after the first day possibly 
because of the administration of shock 
after that day. The mean pain ratings 
obtained on the first 15 min. of the 
first day are shown in Fig. 3 and here 
it may be seen that reported pain 
decreased with continued immersion 
for all groups. The > 400-sec. group 
Consistently reported greater pain 
than the other two groups- The other 
two groups experienced essentially the 
Same amount of pain. . 
Earlier we noted the loss of six 5s 
Who quit during or after the first day's 
test. Thus, none of these Ss experi- 
enced the electric shock. All of them 
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Fic. 3. Mean pain ratings of the three 
Lewis wave groups as a function of immersion 
time. 


were immersed for at least 400 sec. 
Every one was found to be character- 
ized by three things: (a) a complete 
absence of vasodilatation during the 
first day's immersion, (b) a self- 
initiated withdrawal from the experi- 
ment, and (c) a current or recently 
experienced intense, real life, emo- 
tional experience. 


EXPERIMENT ll 


The purpose of this experiment was 
to extend the previous one by examin- 
ing the relationships between indivi- 
duals classified by first exposure 
latency to vasodilatation and perform- 
ance in an independent situation in 
which performance might be thought 
of as sensitive to arousal. 


Method 


A conflict-uncertainty task was developed 
using preliminary experimentation to select 
its procedural and apparatus characteristics. 
The apparatus was a small panel containing 
six doorbell buttons arranged in a circle. A 
seventh button in the center of the circle 
turned on a pilot light at the top of the panel. 
The S was instructed to turn on the light with 
the center button and to turn it off with one of 
the peripheral buttons. He was told that on 
the first trial all of the six buttons would turn 
the light off; on the second trial all but one 
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would work; on the third trial all but two, 
etc., until only one button worked. He was 
also told that the arrangement of wrong 
buttons would not be consistent from trial to 
trial. Actually, all buttons always turned off 
the light. 

Just before the first trial S was given a 
strong, 2-sec. shock defined as in Exp. I, 
through electrodes attached to the wrist of the 
working hand. He was informed previously 
that this was a sample of the shock he would 
receive each time he pressed a wrong button. 
He was informed that he would accumulate a 
sum of money throughout the test regardless 
of errors, starting with 5 cents on the first 
trial, which it will be recalled had no risk, and 
cumulating to $1.00 if he completed all trials. 
He was told that he could quit at any time 
and keep his accumulated earnings. Since, 
although unknown to S, all buttons were 
correct on all trials, no S ever received more 
than the sample shock. The first trial began 
60 sec. after the shock. In addition to the 
procedures described, continuous recordings 
were made of the index-finger temperature of 
the nonworking hand. 

The hand-cooling procedures were carried 
out as before with a 20-min. equilibration to 
90° F., 50% RH, followed by a 20-min. water 
immersion at 34° F. No shock or electrodes 
were involved. 

Subjects.—Thirty-three male, undergrad- 
uate students were used as paid Ss. None of 


these had had any prior experience with any 
of the procedures used. 


Results 


Nine Ss vasodilated between 270 to 
400 sec. ; 24 vasodilated at > 400 sec. . 
none vasodilated at < 270 sec. Al. 
though most of the 33 Ss required only 
a few minutes to complete the task, 
one S of the > 400-sec. group, re- 
quired over an hour. This S's data 
will be presented separately. The 
remaining Ss are shown in Fig. 4 in 
terms of mean time to select among 
the six buttons and mean finger tem- 
perature of the nonworking hand. 
The finger temperatures shown are 
for 30 sec. before the sample shock, 30 
sec. after it, and at the time of each 
decision. 

The results shown in the upper part 
of Fig. 4 indicate that the > 400-sec, 
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Fic. 4. Finger temperature (top) ven 
decision time (bottom) of the two Lewis wav 
groups during the conflict-uncertainty task. 


group started with a higher finger 
temperature. With shock the 7 400: 
sec. group dropped to and remaine 
at a lower finger temperature than the 
other group which indicates a greater 
rate and amount of vasoconstriction: 
Both groups showed a progressively 
increasing vasoconstriction as the task 
progressed. ; 
Inspection of the decision times 1? 
the lower half of the figure shows that 
the > 400-sec. group started with 2 
slightly longer decision time than di 
the other group, but with the first 
involvement of risk (Trial 2) dns 
group showed a marked decrease 1” 
decision time. After the second trial) 
the curve rises steeply so that at first 
it still represents a faster decision tim? 
than for the 270-400 sec. group, hus 
later it crosses the curve of this grouP 
and as the risk increases further: ! 
rises to a maximum of over 17 SCC 
and then drops slightly. The decisio" 
times of the 270-400 sec. group, 0t = 
other hand, showed a more-or-les® 
continuous decrease with increasing 
tisk, and on the last trial, not unlike 
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the > 400-sec. group, they showed an 
end effect. 

One S in the last experiment, not 
included in the above results, required 
over an hour to complete the task. 
His data are shown in Fig. 5. These 
data are very interesting although not 
completely consistent with the previ- 
ous ones. . 

Note that the ordinate of Fig. 5 is 
in minutes rather than seconds. T he 
S started with a slow decision time. 
His decision times then increased 
linearly until the third trial. — This 
represents a large increase relative to 
the previous figure- On the fourth 
trial it took him over 14 min. to make 
a decision, on the fifth trial over 20 
min. Looking at the finger tempera- 
ture of this S, there is some suggestion 
of a mirror-image relationship al- 
though with a time lag. In any case, 
it is clear that as finger temperature 
Teversed, decision time showed aide 
crease, although with a lag. This ; 
Started with a finger temperature o 
88.2° F, as compared to 88.0° F. for 
the mean of the > 400-sec. group in 
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the last figure. However, the mini- 
mum temperature reached for that 
group was 84.4° F. whereas the mini- 
mum of this S was 80.6° F. 


DISCUSSION 


The results of Exp. I appear very clear 
in supporting Meehan's (1957) observa- 
tion that cold-induced vasodilatation 
may be influenced by emotional stress. 
That experiment also suggests that the 
latency of vasodilatation can be in- 
creased by the threat of shock and that 
the degree to which the latency is affected 
depends on the intensity of the threat. 
The continuation of effect to the fourth 
day by both shock groups, even though 
they were assured that they would not be 
shocked on that day, suggests a condi- 
tioning phenomenon, and, to the extent 
that this can be verified, suggests the 
possibility of a more or less lasting 
alteration of the cooling curve. On the 
other hand, the results suggest that in 
the absence of additional stress, Ss 
habituate to the cold water experience. 

When the results of the treatment 
effects of Exp. I are considered along with 
the characteristics of the dropout Ss of 
that experiment and along with related 
observations (Meehan, 1957; Teichner, 
1963), the hypothesis is suggested that 
ability to adapt (latency) to cold stress 
under conditions which permit a reflex 
vasodilatation is related to a relatively 
high chronic arousal characterized by a 
tendency to vasoconstrict under emotion- 
producing stresses. A direct source of 
support for this hypothesis comes from 
results showing a delayed indirect vaso- 
dilatation in schizophrenics as compared 
to normals (Henschel, Brozek, & Keys, 
1951) and vasomotor habituation of the 
orienting response (Sokolov, 1960; Unger, 
1964). Related support may also be 
found in other studies which have re- 
ported inadequate peripheral vascular- 
ization in schizophrenics (e.g., Abram- 
son, Schloven, & Katzenstein, 1941; 
Shattock, 1950) and experiments finding 
vasoconstriction following induced emo- 
tional states (Hovland & Riesen, 1940; 
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haps it should be mentioned here that 
Broadbent’s original findings, utilizing 
lists of digits, were replicated at Indiana 
University in 1962 using the same equip- 
ment as was used in the present experi- 
ment.) Thus, even if Broadbent's 
estimate of the time required to switch 
attention from channel to channel were 
revised, his theory as it now stands would 

not be able to account for the observed 

results of this experiment. 

It is quite possible that telling half the 

Ss in the present experiment that if they 

switched channels in a specified way 

some of their answers might make sen- 

tences may have given them some sort 

of set which enabled them to listen to the 

words all at once in some way and then 

rearrange them to get sentences before 

responding. In fact, many of the Ss did 

manage to rearrange a number of the 

SCR lists and get sentences out of them. 
In this regard it is interesting to note 


that after the experiment was over 
several of the Ss who had received the 
“channel” instructions reported spon- 
taneously that some of the lists made 
sentences even though they should not 
have been switching channels at all. 
Again this indicates that these Ss were 
attending in a manner which Broadbent's 
theory says they should be unable to do, 
at least at the rate of presentation used 
in the present experiment. 
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SUPPLEMENTARY REPORT 


EFFECTS OF CS AND UCS CHANGE ON EXTINCTION 
OF THE CONDITIONED EYELID RESPONSE 


LOUIS E. PRICE, DAVID W. ABBOTT, AND WILLIAM E. VANDAMENT? 


Unicersity of Massachusetts 


Ss were given 20 extinction trials, 


following 70 acquisition trials on 


continuous reinforcement, under 1 of 6 combinations of CS and UCS 


conditions. 


tended, and the UCS was delayed or omitted. 


The CS was unchanged from acquisition, shortened or ex- 


The CS conditions did 


not have a significant effect on extinction while the delayed UCS led to 


slower extinction than the omission condition. 


interaction. 


McAllister (1953) and Reynolds (1958) 
have both reported that CRs extinguish faster 
under a UCS omission procedure than with 
the technique of extending the CS-UCS 
interval to 2,500 msec. following a partial- 
reinforcement schedule during acquisition. 

Spence, Rutledge, and Talbott (1963) and 
Reynolds (1958) found that following con- 
tinuous reinforcement there were no signifi- 
cant differences in extinction rate with the 
UCS delay vs. UCS omission procedures. 

Spence et al. discuss these results in terms 
of a discrimination hypothesis. They point 
out that the ability of Ss to discriminate the 
change from acquisition to extinction is à 
function of the degree of stimulus change 
from acquisition to extinction. Following this 
interpretation, the UCS delay procedure 
should lead to slower extinction since the UCS 
omission would provide a major, stimulus 
change from acquisition to extinction. Fol- 
lowing a partial-reinforcement schedule, the 
UCS delay in extinction should be an even 
less discriminable change since in nonrein- 
forced trials during acquisition Ss would have 
experienced this same UCS condition. Fol- 
lowing 100% reinforcement in acquisition, it 
Would appear that both UCS omissions an 
UCS delay are discriminable enough changes 
to lead to rapid extinction since neither 
condition had been present during acquisition. 

Spence (1963) points out another stimulus 
variable which may affect the discriminability 
of the change from acquisition to extinction. 
This is the duration of the C» When the 
UCS is delayed in extinction the CS duration 
is increased from its acquisition interval to 
overlap with the UCS. He suggests that this 
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There was no CS-UCS 


factor following continuous reinforcement for 
the UCS delay extinction may be as effective 
a cue for Ss that things have changed as the 
UCS omission is. In support of this notion, 
he reports that UCS delay does lead to slower 
extinction than UCS omission following con- 
tinuous reinforcement if the CS is of long 
duration during both acquisition and ex- 
tinction, thus eliminating the cue of CS 
change. 

The present study was designed to further 
evaluate the effects of CS and UCS change 
from acquisition to extinction following con- 
tinuous reinforcement. Three CS conditions 
were combined factorially with two UCS 
conditions. The CS was shortened, length- 
ened, or unchanged in going from acquisition 
to extinction while the UCS was either de- 
layed or omitted in extinction. 

Method.—Forty-eight men and 48 women 
from psychology courses at the University of 
Massachusetts served as Ss. In addition, one 
man was discarded from the study due to 
apparatus failure. 

A dental chair for Ss was located in a 
semidarkened room adjoining the room con- 
taining the recording apparatus and stimulus 
controls. The CS was located 50 in. in front 
of S and consisted of an increase in the 
illumination of a 6-cm. wide circular milk- 
glass disk from 0.032 to 1.364 apparent foot- 
candles (ftc.). The UCS was a corneal air 
puff of 2-psi intensity. Background noise 
from an electric fan prevented Ss from hearing 
any apparatus noise. The intertrial interval 
was controlled by a Gerbrands programmer 
while CS duration, CS-UCS interval, and UCS 
duration were controlled by a series of Hunter 
timers. A record of eyelid movement, via a 
microtorque potentiometer, an amplifier, and 
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TABLE 1 
MEAN NUMBER OF CRs DURING 10-TRIAL BLOCKS or EXTINCTION TRIALS 


cs 
Trial Blocks Shortened Unchanged Lengthened 
Omitted UCS | Delayed UCS | Omitted UCS | Delayed UCS | Omitted UCS | Delayed UCS 
à ES 28s 188 | Er K^ $31 


pen motor, and a record of stimulus onset 
were obtained with a pen recording system 
using a paper speed of 120 mm. per sec. 

Once S was seated in the dental chair and 
the recording apparatus had been attached, 
he was read the standard instructions which 
were designed to create a relaxed, passive 
attitude. Each S was then given 70 acquisi- 
tion trials and, with no delay other than the 
usual 15-sec. intertrial interval, 20 extinction 
trials. During acquisition the CS duration 
was 550 msec. with a CS-UCS interval of 500 
msec. and a UCS duration of 50 msec. For 
the extinction trials Ss were randomly as- 
signed to one of the six extinction conditions, 
within the restriction that half of each group 
be males and half be females. The extinction 
groups were orthogonal combinations of three 
CS conditions and two UCS conditions, The 
UCS was delayed to 2,500 msec. or omitted 
while the CS duration remained unchanged 
at 550 msec., was shortened to 350 msec., or 
was extended to 2,550 msec. Responses fall- 
ing in the 200-500 msec. range from CS onset, 
with a recorded deflection of 1 mm. or more, 
were counted as CRs during both acquisition 
and extinction. 

Results and discussion —Al groups were 
responding at about 84% CRs during the last 
block of 10 acquisition trials? A factorial 
analysis of variance was carried out on the 
mean number of CRs for the last 10 acquisi- 
tion trials with CS duration and UCS condi. 
tions as variables to check that groups had 
reached the same level of performance prior 
ron P PN ted on the qa 
voluntary-form responders on the basis of the form of 
their anticipatory eyelid response (Spence & Ross 
1959). Separate analyses of both acquisition and ex: 


tinction data were made excluding the 20 voluntaries, 


however, the results did not differ from what is reported 
in this article. 


to extinction. This analysis produced no 
significant Fs for either main effects or the 
interaction. Table 1 presents the mean 
number of CRs for each 10-trial block of 
extinction trials for each combination of cs 
and UCS conditions. An analysis of variance 
was performed on these data with CS and 
UCS conditions and sex as between-Ss vam 
ables and trials as a within-Ss variable. The 
two significant Fs were Trials, F (1, 84) 
= 8041, p <.001, and UCS Condition, 
F (1, 84) = 9.45, p < .01. 

The significant UCS condition was due t 
à slower extinction rate for the delayed uc 
procedure. This result, along with the finding 
that the CS condition was not significant 
(F <1), suggests that the possible D leve 
effect of omitting the UCS, as pointed out by 
McAllister (1953), may also be a relevant 
variable here, f 

It is clear from our results and those a 
Previous investigations that further WoT 
needs to be done to understand the effects 0 
variables which influence the rate of extinc- 
tion in the eyelid situation. 
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STIMULUS DETERMINANTS OF CHOICE BEHAVIOR 
IN VISUAL PATTERN DISCRIMINATION ! 


JAQUES KASWAN, STEPHEN YOUNG, axb CHARLES Y. NAKAMURA 


University of California, Los Angeles 


5 Ss participated in a forced-choice, pattern-recognition study. In each 
of a total of 10 sessions, Ss were required to identify tachistoscopic 
presentations from a set of 3 or 2 patterns. In separate sessions each S 
received 1 set of 3 patterns and 3 subsets of 2 patterns, the latter con- 
taining all the possible combinations of the 3-alternative set. Exposure 
durations ranged from 4 to 128 msec. It was assumed that spatial 
distance characteristics of the patterns and exposure duration were 
stimulus dimensions which help determine the perception of these 
patterns. ‘This assumption was used to predict (a) the order of response 
probabilities to each pattern at different levels of exposure duration, 
(b) the accuracy of pattern identification as a function of exposure 
duration, and (c) the response probabilities to each pattern in 2-alterna- 
tive sets from the responses to these patterns in the alternative set. 
The predictions were generally confirmed, indicating that these stimulus 


dimensions may 
behavior. 


The study of choice behavior in 
Perceptual-recognition tasks has re- 
cently been a matter of considerable 
Interest. |n these tasks, S is generally 
required to specify which member ofa 
Set of alternative stimuli (e.g., A,B,C) 
has been presented by using response 
alternatives (a,b,c) that are in one to 
One correspondence with the stimuli. 
An adequate account of such choice 
ehavior should include, first, predic- 
tions of the probability with which 


‘This study was supported by a grant from 
the National Science Foundation (G-9589) 
and a fellowship (for Stephen Young) from the 
Public Health Service (5-FI-MH-19,664-02). 
Ve wish to thank Sally Haralson and Ruth 


us for help in collecting and analyzing the 
ata, 


contribute substantially to the determination of choice 


each response alternative occurs to 
each stimulus in a given set, and 
secondly should predict how the 
response probabilities obtained from a 
particular set of stimuli (e.g., the set 
A,B,C) are related to a subset of this 
set (e.g., the stimulus set A,B with 
response alternatives, a,b). In exist- 
ing formulations, these predictions are 
based either on empirically obtained 
rules for choice behavior (Clarke, 
1957), or on logical conceptions which 
assume that choice behavior results 
from a decision process applied to 
some internal representation of the 
relations between the stimuli within 
the set of alternatives (Luce, 1963). 
These formulations make no explicit 
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Fic. 1. Sample sections of patterns and 
distance between spaces in mm. 


attempt to consider the specific role 
of physical stimulus dimensions as 
determinants of choice behavior. For 
example, consideration of stimulus 
effects is often limited to differences 
in discriminability between alterna- 
tives which might consequently be 
arranged on a similarity scale (Luce, 
1963), or the number of alternatives 
and signal-to-noise ratios (e.g., Att- 
neave, 1959; Garner, 1962). 

Some of our previous work has 
Shown that exposure duration and 
spatial arrangement of stimuli are 
important stimulus dimensions deter- 
mining pattern perception (Kaswan, 
1958; Kaswan & Young, 1963). A 
major goal of the present study is to 
explore the utility of these stimulus 
dimensions in predicting choice be- 
havior in a tachistoscopic pattern- 
recognition task. The specification of 
these variables can be illustrated with 
reference to Fig. 1, which shows 
patterns constructed on the basis of 
formulations developed in our previ- 
ous studies (Kaswan, 1958; Kaswan & 
Young, 1963). Each of these patterns 
can be defined in terms of a sequence 
of spaces, with each space delimited 
by a pair of vertical lines. Previous 
work suggests that beyond its role in 
detection, exposure duration deter- 
mines the perception of existing differ- 
ences among spaces within the pattern 
presented, according to the followin: 
assumptions: (a) If the pattern con- 
tains two adjacent spaces x and y, 
which are equal in size, these spaces 
should not be perceived as different 
in size, regardless of exposure dura- 
tion. Thus, the Ev pattern in Fig. 1, 
should always be perceived as equally 
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spaced. (b) If Space x and Space y 
differ in size, then the probability of 
perceiving them as different decreases 
with decreasing exposure duration. 
Thus, at very brief exposures, differ- 
ences in the size of spaces within the 
“3” or “1,2” pattern (see Fig. 1), are 
unlikely to be perceived and, there- 
fore, all spaces should be perceived as 
equal. (c) If Space x and Space ¥ 
differ in size, the greater the difference, 
the greater the likelihood that they 
will be perceived as different at 
a given exposure duration. Thus, 
longer presentations of the 1,2 pattern 
should be required for the perception 
of the difference between the 4- Mp 
6-mm. spacing than between the 2 
and 8-mm. spacing within this pat- 
tern. These assumptions should be 
relevant for any stimuli where relative 
distance between elements determines 
groupings like those described in the 
Gestalt literature as following the 
Principle of proximity. ) 
The purpose of this study was (4 

to test predictions derived from these 
assumptions for the relative frequency 
with which each response alternative 
would occur to each pattern as 4 
function of exposure duration when 
all three patterns are stimulus-re- 
sponse alternatives, and (b) to use 
these assumptions to translate the 
results obtained in the three-patter? 
task into predictions of the wes 
probability with which each availab! 

response alternative occurs when On * 
two of the patterns are possible st 
ulus-response alternatives (i.e., s 
Ev-1,2; 1,2-3). The specific pred! 


i i : . Its 
tions will be discussed in the resu 
section. 


METHOD 


s 
Materials.—Sections of the three pandir 
used are shown in Fig. 1, together with t 
dimensions. They were constructed Ora in 
asis of assumptions which are outline were 
Other parts of this paper. The patterns ¥ 
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drawn through the center of an 8-in. square 
field and multilithographed onto flat-white 
cardboard sheets to permit the arrange- 
ment of randomized decks of stimuli. The 
Ev, 3, and 1,2 patterns contained a total of 26, 
27, and 24 black vertical lines and their total 
lengths on the stimulus card were 63s, 67s, 
and 51i in. respectively. Each of these 
patterns subtended a visual angle of ap- 
proximately 11°. 

` Apparatus.—The patterns were presented 
with a two-field, electronically. controlled 
mirror tachistoscope. This tachistoscope is 
described in more detail elsewhere (Kaswan & 
Young, 1963). The fixation field contained 
two I-in. markers at each end of the field to 
indicate the horizontal line on which the 
stimulus appeared. 

Procedure.—In each session, one of the 
Possible combinations of the three patterns 
constituted the set of stimulus and response 
alternatives. Combinations used were the 
entire set of three stimuli (three-alternative 
condition) and each of the three possible sub- 
Sets of two stimuli (two-alternative com- 
binations), Each S received all of these 
combinations. 

After a .5-sec. warning buzzer, S presented 
the stimulus to himself by pulling à lever as 
soon as he was ready. Each of the stimuli 
used in a given session was initially presented 
for two, .5-sec. exposures in order to inform 
S of the alternatives available in the session. 
The S was requested to describe these initial 


Presentations in his own words. After correct 


descriptions were given, which was always the 
the stimulus 


names "even," "three," and “one-two” to 
identity presentations which corresponded to 
In addi- 


the Ey, tively. 
d , 3, 1,2 patterns, respectiv Y, ith the 


astimulus. 


attern and 
f practice, 
ted under each of 


ML exposure durations (4, 6, 
a 128 msec.), 5 times in eac 
ive session and 10 times in each two-alterna 
tive session, Exposure duration and pattern 
Sequence were randomized in each session for 
Sach S. There was an interval of about 11 

Sec. between presentations of each pattern. 
Each S had 10 sessions in two successivo 
locks of 5 sessions each. A block consisted 
ti 1 session of each two-alternative combina- 
‘on followed by 2 three-alternative sessions. 
N€ session sequence for the two-alternative 
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combinations was randomized within blocks 
for each S. Thus, over the 10 sessions, each 
S received 20 replications of each pattern and 
exposure duration under each stimulus com- 
bination. These were completed within 4 wk. 
Six blank cards were also randomly inserted 
for each of the 20 exposure-duration-pattern 
replications given each S, to test for false 
positive responses. 

Exposure luminance remained constant at 
about 11.84 mL. All viewing was binocular. 
The experiment was administered to individ- 
ual Ss in a dark room, except for a 6-w. light 
used by E. 

Subjects.—The Ss were one male senior and 
four male graduate students, all in psychology. 
No S was sophisticated in the area of percep- 
tion; all had little or no experience in serving 
as Ss in such experiments, and were naive as 
to the purpose of the study. 


RESULTS 


Figure 2 shows the percentage of 
each error response (solid and dashed 
lines) and the percent "nothing" re- 
sponses (dotted lines) as a function of 
exposure duration for each pattern 
under all conditions for the five Ss 
combined. Figures 2a, 2b, and 2c 
show the results for each pattern in 
the three-alternative condition. Fig- 
ures 2d, 2e, and 2f show these results 
for the two-alternative combinations. 

In order to utilize the most stable 
results, only the last 10 of the 20 
replications for each pattern and ex- 
posure duration for every S and com- 
bination were used in the analyses 
presented below. Each point in Fig. 
2 is thus based on 50 exposures. The 
error response percentages (solid and 
dashed lines) were calculated from the 
total number of responses excluding 
“nothing” responses. 

The dotted lines in each part of 
Fig. 2 show that the percentage of 
“nothing” responses was about the 
same to each pattern in every com- 
bination. All blank card presentations 
were responded to with “nothing” 
indicating that false positive responses 
can be discounted in these results. 
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and dashed curves). (Dotted curve: 
bars show the SE of the percentage 
pattern in the three-alternative condition 
in the two-alternative combinations.) i 


Representative values of the SE of 
the percentage are indicated by ver- 
tical lines at every second point of 
each error response curve in Fig, 2. 
These estimates were computed as 
vee 


where P is the percentage of 


responses shown for a given point in 
Fig. 2,Q = 1 — P, and N = the total 
number of responses excluding ‘noth- 
ing" responses, given to each pattern 


ponses to each stimulus 
s show the percent 
[see text]. 
Figures 2d, e, and f show 


as a function of exposure duration (solid 


Tcentage of "nothing" responses. ponsi 
Figures 2a, b, and ¢ show responses to cac 


responses to cach pattern 


at the exposure duration indicated bY 
the point. 


Three-Allernative Condition 


Ev pattern—Since there are 2O 
variations in spacing within the Ev 
Pattern, variation in exposure dura- 
tion should not, according to AssumP" 
tion a (see Introduction), affect 1t 
discrimination, and ideally all od 
Sponses to it should be Ev response? 


TUS 


STIMULUS DETERMINA 


As shown in Fig. 2c, there were indeed 
very few erroneous responses to this 
pattern. 

3 pattern —According to Assump- 
tion b, decreasing exposure durations 
will decrease the probability that the 
difference between spaces will be 
discriminated. Given all three pat- 
terns as possible alternatives, we 
would therefore expect a high inci- 
dence of Ev responses to the 3 pattern 
at the briefest exposures. Since de- 
creasing exposure duration is pre- 
sumed only to decrease the probability 
of perceiving existing differences be- 
tween spaces, the 1,2 response should 
hot occur to this pattern. Figure 2a 
confirms these predictions in showing 
that error responses to this pattern 
were limited largely to Ev responses. 
The upper half of Fig. 3 shows re- 
sponses to this pattern for individual 
Ss. These functions are clearly con- 
Sistent with Fig. 2a. 

1,2 paltern.—As for the 3 pattern, 
differences between spaces should not 
be perceived at the briefest exposures, 
according to Assumption b. Under 
these conditions, we expect Ev re- 
sponses to be the predominant error- 
Figure 2b (solid curve) shows this to 
Frequency of Ev re- 
, as exposure 


Pe the case. 
Sponses decreased sharply 
duration lengthened. 
Assumptions b and c imply 
large differences between spaces wil 
e the first to be perceived as a func- 
lion of increasing exposure duration. 
At exposures of intermediate length, 
therefore only the larger difference 
; „s. 8 mm.) in the 
to be perceived, 
suppose 


that 


between spaces (4 V 

:2 pattern is likely 
and it seems reasonable to 
that this perception more closely ap- 
Proximates the 3 pattern than the 
Other alternatives. ln any €ase, Ev 
IS not a probable response under these 
Conditions, because spacing differ- 
“Nees should be perceived, $0 that the 
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3 response should be the predominant 
error response at exposures of inter- 
mediate length. Accordingly, the 3-re- 
sponse function to the 1,2 pattern 
should look like an inverted V, first in- 
creasing, then decreasing with length- 
ening exposure duration. The 3-re- 
sponse curve in Fig. 2b (dashed line) 
is quite flat; however, this result is 
attributable to considerable individual 
differences in the exposure duration 
required for the emergence of the 
3-pattern response. The lower half of 
Fig. 3 shows the responses to the 1,2 
patterns in the three-alternative con- 
dition for each of the five Ss. (Sub- 
jects 2,3,5 did not detect the 1,2 stim- 
ulus at 4 msec. This is indicated in 
Fig. 3 by breaking the initial portion 
of the solid curves for these Ss.) For 
all Ss the percentage of Ev responses 
(solid lines) drops off fairly quickly 
with lengthening exposures. As ex- 
pected, the modal error percentage for 
the 3 response (dashed lines) occurs, 
in each case, at a longer exposure than 
the mode for the Ev response. Thus, 
the error curves for individual Ss are 
consistent with the assumptions. 


Two-Alternative Combinations 


The results are shown in Fig. 2d, 
2e, and 2f. These figures were drawn 
so as to be directly comparable to the 
three-alternative results. Note that 
the error curves within Fig. 2d, 2e, and 
2f are derived from different two- 
alternative combinations. For ex- 
ample, the 3 pattern was a stimulus 
in the 3-Ev as well as the 3-1,2 com- 
binations. Accordingly, the Ev and 
1,2 error response curves to the 3 
pattern shown in Fig. 2d derive from 
responses given in these different 
combinations. Also, since the 3 pat- 
tern occurred in two combinations, 
two distributions of "nothing" re- 
sponses were obtained, as shown in 
Fig. 2d. Responses to the 1,2 and Ev 
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patterns are shown in Fig. 2e and 2f, 
respectively. 
_ Assumptions for the three-alterna- 
tive condition led to the prediction of 
à 3-point, temporally determined or- 
dinal scale of probable perceptions for 
the 1,2 pattern. That is, the proba- 
bility of perceiving this pattern as 
evenly spaced was considered maximal 
at the briefest exposures, leading to a 
high proportion of Ev responses. At 
exposures of intermediate length, only 
the largest difference between spaces 
was likely to be perceived, leading to 
responses as the most likely error. 
At longer exposures, when all differ- 
ences between spaces are likely to be 
perceived, 1,2 responses should be 
maximal. Perception of the 3 pat- 
tern can be described by the first 2 
Points on the above scale. Thus, 
what is perceived in the three-alterna- 
tive condition presumably approxi- 
mates the available alternatives, but 
the perceptions as such should be 
independent of available responses. 
In fact, perceptions of the 1,2 and 3 
patterns at some durations in the two- 
alternative combinations should not 
approximate the spacing character- 
istics of available alternatives. For 
example, both the 1,2 and the 3 
Patterns should often appear as evenly 
Spaced at brief exposures, but the Ev 
Pattern is not an alternative in the 
1,2-3 combination. We propose a 
Simple decision rule that will operate 
under these conditions: S will choose 
the available alternative which is 
Closest to what he perceives along 
e temporally determined perception 
Scale specified above. . 
. 1,2 pattern —In the 1,2-3 combina- 
tion, perceptions of the 1,2 pattern 
“Pproximating the Ev pattern will 
€ad to 3 responses rather than 1,2 
Tesponses, according to the decision 
Tule. In this combination, 3 responses 
to the 1,2 pattern thus come from 
© presumed percepts. One is from 
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the Ev perception just discussed. The 
other source of 3 responses derives 
from responses to exposures of inter- 
mediate length (see above). The 
proportion of 3 responses to the 1,2 
stimulus in the 1,2-3 combination 
should, therefore, equal the sum of 
3 and Ev responses to the 1,2 stim- 
ulus obtained in the three-alternative 
condition. 

The dashed curve in Fig. 2e shows, 
in fact, a higher percentage of 3 
responses to the 1,2 stimulus in the 
1,2-3 combination than in the three- 
alternative condition (dashed curve, 
Fig. 2b). Theassumed addition of Ev 
and 3 responses will be discussed 
further below. 

In the 1,2-Ev combination the 3 
response is not an available alternative 
at intermediate exposures, when only 
the largest spacing difference is pre- 
sumed to be perceived. Under these 
conditions the 1,2 response should be 
selected because it is the only alterna- 
tive containing some spacing differ- 
ences. Accordingly, the 1,2 pattern 
should be correctly identified, except 
when it is perceived as equally spaced. 
The proportion of Ev responses should 
therefore be the same as that obtained 
in the three-alternative condition. 
Comparison of Fig. 2e and 2b (solid 
lines) shows, in fact, that the Ev 
response functions to the 1,2 stimulus 
were very similar in the 1,2-Ev and in 
the three-alternative condition. In 
both cases Ev responses occurred 
largely at brief exposures. 

3 paltern.—Assumption b leads to 
the same response prediction for this 
pattern in the 3-Ev combination as 
in the three-alternative combination 
since, as before, this pattern is per- 
ceived either correctly or as equally 
spaced, depending on exposure dura- 
tion. The solid lines in Fig. 2a and 
2d show that the two functions are 
indeed very similar, except at the 
briefest exposures, where there were 
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TABLE 1 


2 ae eae 
OBTAINED AND PREDIC ERROR RESPONSE 


PROPORTIONS TO EACH STIMULUS IN 
Two- AND THREE-ALTERNATIVE 
COMBINATIONS 

Two-Alt. 
Three-Alt. | Two-Alt. | Prediction 
SR Obt. Prop. | Obt. Prop. | — 

Ours | CRR 

3-Ev 426 376 426 | .439 
Ev-3 057 112 057 | .050 
Ev-1,2 -006 019 -006 | .006 
1,2-Ev -180 -230 .180 | .222 
3-1,2 .027 .084 .027 | .046 
1,2-3 .188 409 .368 | .229 


more Ev responses in the three- 
alternative condition. 

In the 3-1,2 combination, percep- 
tion of the 3 stimulus as evenly spaced 
should lead to 3 responses according 
to the above decision rule. Asin the 
three-alternative condition, no 1,2 
responses to this pattern should occur, 
so that in this combination no in- 
correct responses should occur to the 
3 pattern. Figure 2d (dashed curve) 
shows that few error responses were, 
in fact, obtained. 

Ev pattern.—As in the three-alterna- 
tive condition, no errors should occur 
to the Ev pattern in the two-alterna- 
tive combinations. Figure 2f shows 
somewhat more error responses to the 
Ev stimulus in the 3-Ev and 1,2-Ev 
combinations than were obtained in 
the three-alternative condition, sug- 
gesting a slight response bias. Errors 
to the Ev pattern (Fig. 2f) are still 
negligible relative to errors to the 3 
and 1,2 patterns (Fig. 2d and 2e), 

Table 1 shows another way of com- 
paring the results of the two- and 
three-alternative conditions. The 
first column of this table identifies 
each possible error response to each 
pattern in either the two- or three- 
alternative conditions. The second 

column shows the obtained propor- 
tion, pooled over the five lowest eX 
posure durations, of each possible 
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error response to each stimulus. pat- 
tern in the three-alternative condition. 
The third column contains similarly 
obtained error response proportions to 
each pattern in each two-alternative 
combination. The fourth column 
lists predictions of response proba- 
bilities in the two-alternative com- 
binations made on the basis of the 
above discussion. To summarize 
briefly, we predicted that error re- 
sponse proportions in the two-alterna- 
tive combinations should be identical 
with those in the three-alternative 
condition except for the 3 responses p 
the 1,2 stimulus. The proportion of 
responses given the 1,2 stimulus 1n the 
1,2-3 combination should be the sum 
of Ev and 3 responses (.180 4- 188, 
see Column 2 of Table 1) given the 
1,2 stimulus in the three-alternativy 
condition. Results shown in Table 
indicate that all of our predictions 
come within 5%, of the obtained 
results. " 
Table 1 is also of interest because 
our predictions from the three- to the 
two-alternative conditions can 9€ 
compared to those which follow Ipan 
the Constant Ratio rule (CR Rh 
This rule was first developed bY 
Clarke (1957) and can be derived from 
Luce's (1963) choice theory. A saei 
ber of studies (e.g., Hodge & Pallets 
1962) have demonstrated good a 
dictive power for this rule. The CR 
asserts that the ratio of any dw 
response probabilities to a i pee 
stimulus is a constant regardless of th l 
number of possible alternative. aR 
sponses to the stimulus. The CR 7 
Predictions must be obtained anp 
ically from the responses to a m e 
Set of stimulus alternatives, and ed 
tule is not concerned with the po 
ulus determinants of these response". 
Thus, the obtained responses show? e" 
Table 1 for the three-alternative [d 
dition were used to predict respon? 
to each pattern in every two-alter™ 
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tive combination.? Column 5 of Table 
1 shows these CRR predictions. On 
the basis of logical and empirical 
considerations the CRR predictions 
should be as accurate as ours except 
for prediction of the percentage of 3 
responses to the 1,2 pattern in the 
1,2-3 combination. 

Here, the obtained results closely 
approximated our prediction that the 
3 responses should correspond to the 
sum of Ev and 3 responses obtained 
to the 1,2 pattern in the three- 
alternative combination. The CRR 
prediction in this case was off by about 
18%. 

DISCUSSION 


present study 
simple as- 


. The results of the 
indicate that use of rather 
sumptions about spatial and temporal 
characteristics which determine the per- 
ception of stimuli can effectively predict 
choice behavior. This formulation can 
Predict response probabilities to each 
stimulus from master to subset (three to 
two-alternative combinations) as well or 
better (in one instance) than the CRR 
rule. Further, it is difficult to see how 


? The predictions from Clarke’s rule were 
obtained by using a derivation from the rule 
given by “Anderson (1959). The derived 
ormula reads as follows, 

P(rils)vt 

S P/M 
reS 
That is, P(r;/s)s, the probability of response 
ři to stimulus s; in the subset, S, is equal to 
P(ri/s) in the master set, M, divided by the 
Sum of the probabilities obtained in M for all 
responses to s which were possible in 5. As 
applied to our data, the predicted CRR pro- 


P(ri/s)s = 


Portion of 3 responses to the 1,2 stimulus 
Would be 
PG/1,2) 790 
Alternative 
= [ P(3/1,2) 
e pa : 
PGAD PUR) - Atermative 


Whereas our prediction would be, 
P(3/1,2)Two 


Alternative 


= [P(3/1,2) + P(E/1,2) Ithree |, - 


Alternative 
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existing formulations of choice behavior 
could predict the directional, nonsym- 
metric order of responses which we 
predicted and obtained (e.g., more 3 
responses to the 1,2 pattern than 1,2 
responses to the 3 pattern). To be sure, 
“response tendencies" could be used as 
separate post-hoc descriptions of each 
asymmetric response tendency obtained, 
but such notions are hardly useful for the 
a priori specification of these stimulus- 
response relations. 

One advantage of the use of these 
stimulus-derived as compared to em- 
pirically obtained response predictions is 
that the former postulate more specific 
behavioral determinants of choice be- 
havior. Attention to such stimulus vari- 
ables in decision theories might therefore 
not only increase the generality of their 
admittedly powerful formulations, but 
help clarify the behavioral bases of the 
decision process itself. 
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DISTRIBUTION AND SEQUENCE EFFECTS IN JUDGMENT ! 


ALLEN PARDUCCI AND ARTHUR SANDUSKY 
University of California, Los Angeles 
In a visual-recognition task, the proportion of trials for which a light 


flashed in the left of 2 closely spaced positions was either 32, 5, or S. 
The proportion of correct identifications of each alternative varied in- 


versely with its proportion of presentation. 
could be deduced from the 1st-order stimulus dependencies, 
more accurate following stimulus alternations tha 


repetitions. This suggests that the successive stir 
effective stimuli. The results appea 


studies of signal detection. 
given, the sequential effects 
model for signal detection. 
include a criterion band, des 
dependencies. 


Judgment implies comparison. 
"Good" and "bad," “large” and 
"small" compare what is being judged 
with some contextual standard. The 
child is told that he is "bad" when 
his behavior is worse than it has been 
in the past or worse than the behavior 
of other children. For Such value 
judgments, the particular context that 
provides the standard may be very 
complex. For simple perceptual phe- 
nomena, however, various contextual 
theories of judgment account for the 
major effects of the relevant physical 
context (e.g., Helson, 1959; Johnson, 
1955; Parducci, 1963). 

These contextual theories predict 
the judgments from the stimulus dis- 
tribution, the relative frequencies with 
which the different physical values are 
presented. For example, a stimulus 
at the midpoint of the range is judged 
“large” if the smaller stimuli are 
presented more frequently, “small” if 
the larger values are presented more 
frequently. But when presentation is 


1 This research was supported by a grant 
(M-5465) from the National Institute of 
Mental Health, United States Public Health 
Service. Special thanks are due Norman 
H. Anderson for a critical reading of the 
manuscript. 


Most of this variation 
Ss being 
n following stimulus 
mulus pairs were the 
r inconsistent with the data from 


Except when informational feedback is 
also appear inconsistent with a learning 
A simple Thurstone model, elaborated to 
cribes both the stimulus and the response 


successive, one stimulus at a time, the 
overall distribution may not have the 
same effects upon the judgment of an 
early stimulus as on a later one. The 
end points of the scale of judgment 
cannot be established until the stim- 
ulus extremes have been presented, 
and the effects of these extreme values 
may diminish as a result of subsequent 
presentations. More generally, the 
judgments of any stimulus may de- 
pend upon the particular stimuli that 
precede it, 

Contextual theories have predicted 
the mean judgments averaged over 
the entire series, ignoring possible 
trial-to-trial changes. However, these 
changes may also reflect the effects of 
the stimulus distribution. Sequential 
changes may even explain overall dis- 
tribution effects, 

The evidence for trial-to-trial se- 
quential effects is sparse for category 
judgments (e.g, Garner, 1953; Par- 
ducci & Marshall, 1962), Typically, 
the stimulus extremes are presente 
early in the series, stabilizing the 
Judgments so that later shifts are 


relatively small. Since there are 
usually at le 


z ast five different stimuli, 
the different subsequences may not 
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occur with sufficient frequency to per- 
mit assessment of sequential effects. 

Preliminary research developed a 
simplified task in which the scale of 
judgment is never stabilized (Par- 
ducci, 1964). With just two stimuli, 
enough data could be obtained to 
permit more adequate analysis of the 
effects of different subsequences of 
stimuli. Lifted weight, visual bright- 
ness, and both loudness and pitch 
yielded similar results: judgments 
appeared to be largely determined by 
the two most recent presentations, 
and Ss were more accurate on stimulus 
alternations than on stimulus repeti- 
tions. The present report is of a more 
extensive study using judgments of 
visual position. 


METHOD 

Stimulus display.—The experimental room 
was completely dark, except for stimulus 
lights displayed on a vertical panel, 5 ft. in 
front of S. ‘The panel had three iin. holes, 
each centered in front of a 1-W. neon bulb, 
Type 45, in series with 20,000-ohm resistance. 
Two of the lights were j in. apart on a hori- 
zontal line, approximately at the eye level of 
the seated S. The third was 11 in. higher and 
centered to complete an isosceles triangle with 
the other two, forming an angle of approxi- 
mately 3? at the apex. . 
. Programing -— recording.— The upper 
light was always on, providing @ constant 
reference point for S. The two lower lights 
Were activated in predetermined sequences, 
with one or the other flashing every thir 
second for 185 msec. Sequences of 300 pres- 
entations were controlled by punched tape, 
the tape reader closing the alternative circuits 
at intervals set up by motor-driven cams 
The relative position of each flash was. in 
dicated by S who pushed a toggle switch 
either to the left or the right. — B 
reader and toggle switch Were connecte o ar : 
event recorder which provided a recor x 
the successive stimuli and the associate 
responses. 

Instructions —After being 
experimental room, S was rea 
instructions: 


seated in the 
d the following 


ur ability to discrim- 


Ia i o 
m measuring y D. appear in two 


inate between lights whic 
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different lateral positions. The room will 
be darkened, and you will see this fixed dot 
of light which will remain on throughout 
the experiment. Once every 3 seconds, a 
second light will flash briefly below this dot, 
in one of two fixed positions. Your task is 
to judge in which of the two positions the 
lower flash appears. If the flash were in the 
left position, you would push this lever to 
the left; if in the right position, to the right. 
The purpose of the constant light is to help 
you keep your general spatial orientation 
in the dark; it does not necessarily bisect 
the positions you are judging; rather, it 
appears in a different relationship to these 
two positions for different people. Head 
movements, for example, are likely to 
change the apparent relationship between 
the top light and the two flash positions. 
Your performance will be most accurate if 
you try to fix your attention on the general 
region where the flashes will occur, using 
the top light to keep your eyes from wander- 
ing between flashes. Always respond to 
each flash of light before the next flash 
occurs—even if you have to guess, which, of 
course, you will have to do for the very first 
presentation. Although it is a fine discrim- 
ination, you will be able to do better than 
chance. The whole series will take 15 


minutes. 


Conditions.—Three distributions of stimuli 
were used, the left light being presented with 
probability .2, .5, or .8. Two randomized 
orders of presentation were used for each 
schedule, and the left presentations were 
balanced in each of the two blocks of 150 pres- 
entations. The number of alternations be- 
tween left and right stimuli were also fixed at 
the expected value for each block, 75 for the 
.5 and 30 for the .2 and .8 distributions. The 
number of homogeneous stimulus runs of 
different length corresponded closely to the 
expected values for each of the blocks. 

‘An additional condition, .5F, employed the 
.5 distribution with informational feedback. 
The correct position, left or right, was an- 
nounced after each incorrect response, E 
making the correction over an intercom from 
the adjoining room. 

Subjects.—lhe Ss were 160 undergraduate 
volunteers, 40 being assigned to each distribu- 
tion and 40 to the .5F condition. Assignment 
was evenly divided between the orders of 
presentation for each condition, 10 of the 20 
Ss exposed to each order being males. Addi- 
tional data will be reported for a single, highly 
practiced S. 


Statistical analyses.—For Fig. 1 and 2 and 
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ARL LRL ?RL RLL LLL 
LLR RLR PLR LRR RRR 


STIMULUS SEQUENCE 


Fic. 1. Mean 
responses followin; 
of stimuli, 
subsequences. 


proportion of correct 
g different subsequences 
averaged for complementary 


also for several of the statistical analyses, un- 
weighted means were taken when averaging 
proportions for complementary stimulus se- 
quences (such as left, left, right; and right, 
right, left) so as not to reflect Ss’ position 
preferences. The major analyses of variance 
included Distribution, Order of Present: 
and Sex as independent variables. 
Order of Presentation nor Sex had si 
effects, except that males w 
more accurate; and the data are not reported 
separately for these two variables. All sig- 
nificant comparisons (b < .05) are reported. 


ation, 

Neither 
ignificant 
ere consistently 


RESULTS 


The major results of this experi- 
ment reduce to a single rule: accur 
is greater on trials for which the 
position of the flash has shifted, It 
is easier for S to discriminate the 
direction of the shift than to keep his 
bearings when the flash recurs in the 
same position. This greater accuracy 
for alternations than for repetitions js 
consistent with common sense, with 
the results of the preliminary studies 
using other stimulus materials, and 
with various models for judgment 
(Parducci, 1964). What is unex- 
pected is the degree to which the many 
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complex features of the data reduce 
to this simple difference. l nless 
informational feedback is provided, 
S's response is almost completely de- 
termined by the physical relationship 
between the presented stimulus and 
the immediately preceding stimulus. 
Contrary to the stimulus we 
postulated by contextual theories O 
judgment, S's response appears to be 
independent of even the second stim- 
ulus back. And yet, as will be shown 
in the sections which follow, the usual 


effects of stimulus distribution are 
obtained. 


Stimulus Dependencies 


The major experimental finding is 
illustrated in Fig. 1. Each point on 
the three curves represents the prapor- 
tion of correct responses, P (c), as 2 
function of sequences of the three 
immediately preceding stimuli (in- 
cluding the stimulus to which the 
response is given). Thus the leftmost 
point on each curve represents agii 
racy for an alternation trial precedes 
by one or more repetition trials (RRL 
indicating a sequence for which the 
order is right, right, left; its com- 
plement is LLR). s 

Consider first the curve for the -5 
condition. The two leftmost points, 
and also the third which is their 
weighted mean, have approximately 
equal values. Alternation trials are 
all the same: accuracy is unaffected 
by the length of the homogeneous 
stimulus run preceding the alternation. 

Accuracy on repetition trials 15 
shown by the two rightmost points On 
the same curve, Although accuracy 
is much lower for repetition trials, it 
is the same for all repetition trials, 
Le., for homogeneous runs of either 
two or else three or more stimuli. The 
response on a given trial is again 


; s 
independent of the second stimulu 
ack. 
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TABLE 1 
MEANS AND SDs or RESPONSE PROPORTIONS 
2 E! 5 E 
M SD M SD M SD M SD 
P(l) AS 10 53 13 252 09 52 .08 
P(e) ‘61 .09 .61 .09 67 .07 .68 .08 
P(c|L) 73 A3 99 A3 .10 10 70 12 
P(c|R) .59 11 .69 1S .65 11 .66 .10 
P(c|LL) .66 48 .56 d4 -63 A3 74 12 
P(c| LR) 66 16 2 AS E 12 .63 4 
P(c| RL) 15 ET .10 E Ni .10 .66 4 
P(c|RR) 56 B ,95 a21 .59 Bt: 68 10 


The same inferences can be made 
for the .2 and .8 conditions. Their 
combined curve, representing un- 
weighted means of their respective 
proportions, closely resembles the .5 
curve, Although the .2 + .8 curve is 
uniformly lower, the differences be- 
tween conditions are small relative to 
the effects of sequences within condi- 
tions. Sequences of more than three 
stimuli occur with insufficient. fre- 
quency to permit reliable assessment 
of sequential effects. However, no 
clear trends were noted with the 
longer series. For instance, P(c) for 
the combined .2 and .8 conditions 
does not drop below .55 for homo- 
geneous runs of up to length 5... 

The .5F curve suggests quite differ- 
ent inferences, the general effect of 
informational feedback being to in- 
crease the probability of the eon 
that would have been correct on me 
Preceding trial. Treatment i “a 
data from the .5F condition W1 P 
reserved for the end of the Resu ts 
Section, The immediate oet 
here is to demonstrate the statistica 
Significance of the major results o 
the conditions in which feedback was 
not given. i 
In the absence of informational feaa im 


accuracy on Trial # depen The 


1 è : and Sn. 
Mediate stimulus pally a S conditional 


means and SDs of these '? 


proportions, P(¢|SnSn-1), are presented in 
the bottom rows of Table 1. For example, the 
last row, labeled P(c|RR), gives the propor- 
tion of correct responses following repetitions 
of R. These four values were combined for 
each S to provide the raw scores for several 
analyses of variance. 

The first analysis assessed the difference 
between alternation and repetition trials, 
Each S’s score was the sum of P(c|LR) and 
P(c|RL), minus the sum of P(c|LL) and 
P(c| RR). The mean difference, .126, is 
highly significant, F (1, 114) = 141.38. Of 
the 120 Ss, 107 were more accurate on alterna- 
tion than on repetition trials. 

The second analysis tested position prefer- 
ence. The difference in accuracy for the two 
stimuli, L and R, was indicated by the sum of 
P(c|RL) and P(c|LL), minus the sum of 
P(c|LR) and P(c|RR). The mean for this 
measure of position preference, .045, 
significantly from zero, F (1, 114) = 5. 
indicating that Ss were more accurate on L. 
A third analysis, based on the sum of P(c|LL) 
and P(c|LR), minus the sum of P(c| RR) and 
P (c| RL), yielded no significant effects. There 
is thus no evidence of interaction between the 
position of the presented stimulus, L or R, 
and the stimulus relationship, alternation or 


repetition. 


Distribution effects.—Thus far, the 
only indication of distribution effects 
has been the difference in Fig. 1 be- 
tween the .2 + .8 and the .5 curves. 
Using the sum of each S's four condi- 
tional proportions as a measure of 
overall accuracy, small but significant 
differences were found for both Dis- 
tribution, F (2, 114) = 3.13, and Sex, 
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F (1,114) — 5.33, the better perform- 
ance coming from males and with the 
.5 condition. Of course, P (c) is 
greatest for the .5 condition because 
stimulus alternations are more fre- 
quent than in the .2 and .8 conditions. 
'The overall proportion of left re- 
sponses, P(/), provides another meas- 
ure of distribution effects. Alter- 
native models for the theory of 
adaptation level agree in predicting 
that P(l) would be independent of 
Distribution (Parducci, 1964). As 
shown in Table 1, this contrary- 
to-common-sense prediction is very 
nearly confirmed. The variation in 
P (l) is only 45 the variation in PL). 
However, the variation in P (I) is sta- 
tistically significant, F (2,108) = 3.23. 
The differences on these overall 
measures seem minor. The major 
effects of stimulus distributions are 
found when judgments are tabulated 
separately for each stimulus in accord- 
ance with the traditional procedure 
for assessing context effects in judg- 
ment. Asshown in Table 1, the mean 
proportion of correct responses to L, 
P(c|L), varies inversely with P, 
F (2, 117) — 14.34; and P (c| R) varies 
directly with P(R), F (2, 117) = 7.14. 
The directions of these Variations cor- 
respond to those obtained with cate- 
gory judgments when the 
distribution of the stimuli is varied 
(Parducci, 1963). In the present case, 
accuracy for each stimulus varies 
directly with the proportion of trials 
on which it was preceded by the other 
stimulus. Thus, the major distribu- 
tion effects follow directly from the 


frequency 


P(Jn = Jua) = P(LL)[PQ|L)P (LL) + P(r 
+ P(LR)EPQ|LD)PQ|LR) 
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proportions of correct responses to the 
different stimulus pairs. 


Stationarity.—There was some loss of ac- 
curacy over the series under all conditions. 
Although the mean drop in P(c) between the 
two halves was only .03 and the mean drop in 
P(I) only .02, individual Ss shifted dramat- 
ically. For one S, P(l) dropped from 88 for 
the first half to .21 for the second half. The 
mean absolute shift in P(I), irrespective of 
direction, was about four times as great as the 
mean algebraic shift for the various conditions. 

Chi squares were computed for each S to 
assess the stationarity of his first-order transi- 
tion matrix (Suppes & Atkinson, 1960, p. 56). 
Separate tabulation was made for the two 
halves of the sequence. Each of eight chi 
squares tested the significance of one of the 
shifts in response frequencies between halves. 
Each test applied to one of the two stimuli, 
conditional on one of the four preceding stim- 
ulus-response combinations. These eight 
values were summed for cach S. More than 
two-thirds of the Ss had significant overal 
chi-square values, indicating a general de- 
Parture from stationarity. Inspection of the 
transition matrices suggests that different Ss 
Shift in opposite directions, canceling each 
other out in the various group analyses. ne 
data of this section suggest that the individua 
deviations from stationarity are mainly shifts 
in response preferences which are not reflected 


in the systematic mean shifts for different 
conditions, 


Response Dependencies 


Responses tend to occur in homogeneous 
Tuns. Response dependency is indicated by 
the proportion of responses that repeat the 
immediately Preceding response. The means 
of the proportions of these first-order repeti- 
tions are .61, .58, and .64 for the 2, 5, and -8 
conditions. These means are greater than 
would be expected from the stimulus depend- 
encies, reported in the previous section for the 
two stimuli and four stimulus pairs. The 
expected proportion of response repetitions IS 


given by the following equation: 


ID)PC|LL) 


+ P(r|L)p 
+ PIREILPOIR PUR X CH DPCILR] 


+ PRRILP CRP | 


Substitution of the response proportions from 
Table 1 into Equation 1 yields the expected 


+ PIRPIR] 


RR) + P(|R)P(|RR)). C] 


Proportions, 


49, .48, and 49. With 90% of 
Ss exceedin 


g the overall expected value fof 
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their condition, it is clear that the preceding 
response is repeated much more frequently 
than would be predicted from the overall 
stimulus dependencies. 

The means of the proportions of second- 
order response repetitions, i.e., of trials for 
which the response repeats the second re- 
sponse back, are .62, .59, and .66. The mean 
difference favors the second-order dependency 
for all three conditions, F (1, 117) = 4.15. 
The expected value for second-order repeti- 
tions, computed using an analog of Equa- 
tion 1, is close to .50 for each of the three 
conditions. 

It is surprising that S’s response should be 
more dependent upon his second response 
back than upon his immediately preceding re- 
sponse. However, the differences between the 
first- and second-order dependencies approxi- 
mate the differences predicted from the stim- 
ulus dependencies. It can be shown that the 
predicted values are higher for the second- 
order dependencies whenever: P(c) > .5 for 
each stimulus and each stimulus sequence, 
and P(c) is greater for stimulus alternations 
than for stimulus repetitions. These condi- 
tions hold for the group data and also for the 
great majority of individual Ss in this expert" 
ment. Another factor is that any occasional 
tendency to alternate responses may decrease 
first-order repetitions but increase second- 
order repetitions. 

Much of the difference between the pre- 
dicted and obtained repetition proportions 
may depend upon individual differences in 
response preference. However, inspection O 
the individual scores indicates that marked 
response dependencies may be found even 
when there are no overall position preferences. 
For example, one 5 in the .5 condition had 
both first- and second-order repetition rates 
of approximately .70, even though his overall 
P (J) and the values expected from his stimulus 
dependencies were each close to .50. 

Analysis was also made of each S's response 
dependencies on stimulus runs of length 3 or 
more (LLL or RRR). Even for those runs 
on which either one OF the other of the pre- 
ceding responses had been incorrect, the e 
jority of Ss were more often epe rn ae 
wrong on Trial T. This indicates t " be 
continuing accuracy on runs ee 
duced to S's tendency to repeat his prev 


Tesponses. 


Practice Subject 

. The major experimental findings er 
licated using a single highly [icem gud 
One undergraduate female was hire 


455 


PRACTICED 
SUBJECT 


PROPORTION CORRECT 


——— 5 


*----248 


RRL LRL ?RL RLL LLL 
LLR RLR ?LR LRR RRR 


STIMULUS SEQUENCE 


Fic. 2. Mean proportion of correct re- 
sponses by practiced S following different 
subsequences of stimuli, averaged for com- 
plementary subsequences. 


for 62 separate 300-trial sessions. Two to 
four sessions were given each day, spread over 
a month, with at least 15 min. between ses- 
sions. The three stimulus conditions were 
given in randomized order; the .5 condition 
was used for about half the sessions, and the 
other half was evenly divided between the .2 
and .8 schedules. Informational feedback 
was never given. 

Her proportion of left responses dropped 
systematically over the first half of the month 
before stabilizing on a moderate preference 
for the right. This drop was associated with 
decreased accuracy on .8 and increased ac- 
curacy on .2. There was no systematic drift 
in accuracy for the .5 condition, the value of 
P (c) always falling between .60 and .72, with 
an overall mean of .673. This was virtually 
identical to P(c) for the single-session .5 
group. There was no clear evidence of inter- 
schedule sequential effects, €g., greater P(I) 
on .5 following .2 than following .8. Nor was 
there any systematic drift in her proportion 
of first-order response repetitions (calculated 
only for the .5 schedule) Although the 
overall proportion was only .507, this was 
significantly above the expected value (based 
on her overall stimulus dependencies). 

As shown in Fig. 2, the stimulus depend- 
encies for the practiced 5 correspond closely 
to the dependencies of single-session Ss. As 
with Fig. 1, Fig. 2 can be summarized by just 
two values: (a) proportion correct on alter- 
nation trials, and (b) proportion correct on 
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repetition trials. However, when Fig. 1 ana 
2 are compared, it is seen that the practice 

S shows a greater difference between alter- 
nation and repetition trials and does not show 
the lower accuracy for the .2 and .8 conditions. 

Chi-square tests were performed upon the 
practiced S’s P(c) in the .2 and 38 conditions 
for each of the four stimulus pairs, using the 
four proportions for the -5 schedule to obtain 
expected values. The shifts in proportions 
were statistically significant, x? (8) = 28.32, 
p < 001. 

As with single-session Ss, the practiced S 
was less likely to be correct on the more 
frequent stimulus: her P(c|L) values were 
-729, .635, .542 for the 2, .5, and .8 conditions. 
This replication of the important distribution 
effects is, of course, consistent with the 
dramatic difference in accuracy 
alternation and repetition trials. 
in conjunction with the stability 
accuracy through the month, 
from a highly practiced S suggest that the 
major findings of this experiment do not 


depend upon some initial set brought into the 
session by naive Ss, 


Feedback 


The data from the .5F condition 
were analyzed in the manner reported 
for the other conditions. As shown in 
Table 1, there was little difference be- 
tween the .5F and .5 values for either 
P(I) or P(c). Indeed, despite the in- 
formational feedback, there was actu- 
ally a mean drop of .023 in P (c) be- 
tween the halves of the Sequence for 
the .5F condition, F (1,39) = 5.48, 
Moreover, the .5F chi 
stationarity closely resen 
for the .5 condition. 

The failure of reinfor. 
P(c) may be clarified by comparison 
of the curves for the .5 and -SF condi- 
tions in Fig. 1. With feedback, the 
response is sharply affected by the 
second stimulus back. Accuracy in- 

creases on repetition trials With in. 
crease in the length of the homogene- 
ous stimulus run, i.e., with additiona] 
reinforcements for the response that 
is now correct. Accuracy on stimulus 
alternations is much lower when the 


between 
Considered 

in overall 
these results 


Squares for 
ibled those 


cement to raise 


alternation is preceded by a stimulus 
repetition, i.e., by two or more rein- 
forcements for the response which is 
now incorrect. The net result of the 
two reinforcement effects is to reverse 
the direction of the overall difference 
in accuracy between alternation and 
repetition trials, and on this com- 
Parison the .5 and .5F conditions 
were significantly different, F (1, 76) 
= 59.37. - 

The mean proportions of the first- 
and second-order response repetitions 
for .5F are .553 and .547, respectively. 
Both are lower than the corresponding 
values for the .5 condition, even 
though the expected values, .510 and 
517, are higher. These changes are 
consistent with the other effects of 
feedback since the proportions for 
response repetitions should approach 
the proportion of stimulus repetitions, 
in this case .5, insofar as Ss follow the 
preceding reinforcement. 


Discussion 
Adaptation-Level Theory 


Definition of the stimulus—Without 
informational feedback, the immediate 
stimulus Pair is the effective stimulus for 
judgment. The many possible stimulus 
dependencies reduce to the proportions 
of correct responses for the four pairs: 
LL, LR, RL, and RR. Prediction of the 
response is not improved by taking into 
account even the second stimulus back. 
The pairs operate as four discrete stimuli. 
f the results are analyzed with respect 
to these four rather than to just two 
stimulus values, all sequential effects 
isappear, 
This redefinition of the stimulus per 
ts interpretation of the data in teris 
frames of reference or adaptation 
levels Which would be impossible with 
Just two stimulus values, L and R. The 
difficulty would lie in the continuing 
difference between alternation and repeti- 
tion trials, ie. first-order stimulus de- 
Pendencies, in spite of the complete 


mi 
of 
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absence of even second-order depend- 
encies. This would require extreme 
weighting for recency, and the frame of 
reference or adaptation level would thus 
be completely determined by the pre- 
sented stimulus and the stimulus which 
immediately preceded it. But extreme 
weighting for recency would leave un- 
explained the differences in response to 
each stimulus pair for different stimulus 
distributions. 

As in previous work (Engel & Par- 
ducci, 1961), theoretical simplification 
may be obtained by employing a more 
generous definition of the stimulus. The 
Present proposal is to include the value 
of the immediately preceding stimulus 
and also a constant background value in 
the definition of the effective stimulus for 
judgment. The background value, B, is 
conceptualized as a reference point on 
the left-right dimension, based on cues 
Provided by the reference light and in- 
teroceptive stimulation. B is introduced 
to account for Ss’ accuracy on repetition 
trials, 

These physical values are combined so 
that Si, the effective stimulus on Trial 7, 
is defined as the difference, S, minus a 
weighted mean of B and Sy-1: 


Ss S. e oB a8], [23 


where a is an empirical constant repre- 
Senting the rclative influence of the back- 
ground and the preceding stimulus. 


This specification of the effective stimulus 
for judgment is a variation on the typical 
equation for adaptation level (Helson, 1959). 

lere, aB + (1 —a)Sn-1 is analogous to 
the perceptual adaptation level, or neutral 
Point on the left-right dimension. If a = 5, 
the successive differences between the effective 
Values of RL, LL, RR, and LR must all be 
equal; the larger the value of a, the smaller 
the difference between the effective values of 

L and LL, and also between RR and ZR. 

The analyses of variance reported for the 

Stimulus dependencies indicated that the 

ifferences in correct response to the four 
effective stimuli can be broken down to two 
major sources: the greater accuracy on 
alternation trials, and the greater accuracy 
9n L trials, The effects of these two sources 
May be considered additive since their inter- 
action is not significant. Almost 85% of the 
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variance between the 12 first-order contin- 
gencies, listed in the bottom four lines for the 
-2, .5, and .8 conditions in Table 1, can be 
accounted for by the following prediction 
equation: 


PlelSm S42) = M+ALP, [3 


where P(c|S,, Sn) is the proportion of 
correct responses to each of the four effective 
stimuli. The values of the Parameters, de- 
termined from the data in Table 1, were 
M = .656; A = +.063 for alternation trials, 
—.063 for repetition trials; and P = +.023 
for L trials, —.023 for R trials. 

The first constant, M, reflects the difficulty 
of discrimination and would be affected by 
environmental conditions such as the angle 
which L and R form with the reference light. 
The alternation constant, A, and the position- 
preference constant, P, are affected by the 
values of a and B in Equation 2. The 
redefinition of the stimulus thus permits this 
simple accounting for most of the variation in 
the stimulus dependencies for the .2, .5, and 
.8 conditions. 

For the feedback condition, however, this 
simple account does not suffice. With feed- 
back, there is increased accuracy on repetition 
trials but decreased accuracy on alternation 
trials. This seems reasonable since Ss would 
be expected to follow the preceding reinforce- 
ment when they cannot discriminate differ- 
ences between successive presentations. The 
proportions for the .5 and .5F conditions each 
differ by about .10. However, there are also 
second-order stimulus dependencies for the 
SF condition. These would require addi- 
tional theoretical analysis. 

Distribution effects.—Two small distribu- 
tion effects are found in accuracy for the 
stimulus pairs. First, the .2 + .8 curve in 
Fig. 1 is lower than the .5 curve. This differ- 
ence is hard to explain, but it is not found for 
the practiced S in Fig. 2. Second, the differ- 
ence between the proportions for the .2 and 
-8 conditions within each of the bottom four 
lines of Table 1 appears to be systematic. The 
LL and RL proportions are much lower for .8, 
but the LR value is lower for .2. Accuracy 
thus tends to be lower for pairs ending with 
the more frequently presented stimulus. 
Similar differences were found for the prac- 
ticed S. This would seem to be a typical 
distribution effect, appearing asa consequence 
of S’s tendency to equalize his use of the two 
responses (Parducci, 1963). Though much 
weaker than the major effects of distribution, 
these differences indicate that neither the 
present analysis, using stimulus pairs, nor 
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an adaptation-level analysis (Helson, 1959), 
using extreme weighting for recency, gives 2 
complete account of the distribution effects. 


The present analysis does account for 
the major effects of stimulus distribution. 
The conventional assessment of these 
effects would be made using the judg- 
ments of L and R, separately. Both 
P(c|L) and P(c|R) vary with stimulus 
distribution. The direction of the varia- 
tion is consistent with the contextual 
theories of judgment which predict 
category shifts toward the more fre- 
quently presented stimulus. However, 
much of the present distribution effect 
follows directly from the first-order stim- 
ulus dependencies. Since P(c|RL) is 
greater than P(c|LL), P(c|L) is neces- 
sarily greater for the .2 condition (which 
has four times as many RL as LL 
sequences) than for the .5 condition 
(where these two sequences have equal 
frequency) or for the .8 condition (which 


has four times as many LL as RL 
sequences). 


Decision Theory 


Signal detection—Although it is not 
clear that the theory of signal detection 
applies to the present task (Tanner, 
1956), the theory correctly describes the 
results of a similar experiment (Kinchla, 
1964). Kinchla found that the propor- 
tion of correct responses for either a left 
or right position of a threshold level, 
visual stimulus varied directly with the 
proportion of stimulus presentations in 
that position. However, P(c|L) varies 
inversely with P(L) in the present experi- 
ment, and P(c|R) varies inversely with 
P(R). The sequential effects were also 
different for Kinchla's detection task, 
His data were fitted by a learning model 
in which P(c) increases on stimulus 
repetitions, as in the present .5F condi- 
tion (Atkinson, 1962). Neither the 
overall shifts in detection rate nor the 
sequential effects attributed to learning 
are found with the present, nondetection 
discrimination, unless there is feedback, 

Perhaps the critical difference between 
the tasks is in the way the responses are 
defined. In the signal-detection task, the 


problem is to perceive the occurrence of 
asignal. In the present task, S always 
perceives the flash, his problem being to 
discriminate its location. This discrim- 
ination requires some kind of comparison, 
L being left with respect to R. The usual 
signal-detection task seems, at least 
formally, to be less dependent upon 
comparisons between alternative stimuli. 

Band-criterion model.—The traditional 
Thurstone model of the stimuli, postulat- 
ing overlapping discriminal distributions. 
can be elaborated to account for both the 
stimulus and the response dependencies 
in the present data. Assume that 
establishes a band criterion such that 
discriminal values to the left and right of 
the band are responded to as “left” an 
“right,” respectively. For each stimulus, 
let 8 denote the proportion of presenta- 
tions with discriminal values outside the 
band. Assume for the present analysis 
that the distribution overlap is negligible 
in the regions outside the criterion 
band so that S is correct on all such 
observations. 

When the discriminal value of the 
stimulus falls within the band, accuracy 
1s assumed to depend upon the preceding 
Stimulus and response. If the discriminal 
value of the preceding stimulus fell out- 
side the criterion band, S would dis- 
criminate the relationship between the 
two discriminal processes, and reverse his 
preceding response. If the value of the 
preceding stimulus fell within the CTI" 
terion band, S would repeat his preceding 
response with fixed probability, a. From 
the foregoing assumptions, it follows that 
P(c) for stimulus repetitions with the : 
schedule is given by B+ a(1— gy/ 
(1 + B) when the preceding response was 
correct, and by 1 — a(1 — 8) when the 
preceding response was incorrect. e 
Corresponding P(c) values for stimulus 
alternations are given by 1 — a(1 — 

(1 -F 8) and 8 +a(1 — 8). Correspond- 
ing expressions for these values were 
derived for the .2 and .8 schedules. 

Assuming that a and 8 are constant for 
all „three schedules, their values were 
estimated to be .83 and .26, respectively 

Y a least-squares approximation to the 
24 first-order proportions, P(c|Se-¥ 
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Jua Sna). The mean absolute deviation 
between fitted and obtained values is 
only .05, with 88% of the variance be- 
tween the 24 proportions being accounted 
for using only two empirical constants. 
The single parameter, 8, accounts for 
most of the difference in accuracy be- 
tween alternation and repetition trials 
and also for the overall level of accuracy. 
The response-repetition parameter, a, 
enables the model to account for the 
difference of .26 in P(c) between trials 
on which the stimulus corresponds to, 
and trials on which it differs from the 
preceding response. The model also cap- 
tures the additional distribution effects 
reflected in the relative drop in accuracy 
for stimulus pairs ending with the more 
frequently presented stimulus. The 
power of the band-criterion model lies in 
its ability to describe this difference and 
also the stimulus dependencies with just 
two empirical constants. i 
Whether the data from this experiment 
are described using the band-criterion 
model or with respect to stimulus pairs, 
most of the effects of stimulus distribu- 
tion can be deduced from first-order 
sequential effects. Sequential data have 
been virtually ignored by the various 
theories of judgment. The success of 
this demonstration should encourage 
analysis of sequential phenomena in 
experiments with additional stimulus 
and response alternatives. It would be 
gratifying to discover that the major 
context effects in more complex and 
perhaps better anchored situations could 


459 


also be reduced to the finer grain of the 
data. 
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University of Cal 


In 3 experiments, Ss learned the 1 


were then differently instructed for List 2 leart 
told to use the List 1 responses as aids 


ing Ss were told to unlearn List 1 r 
fere with List 2 learning. The d 
the rate of List 2 learning, 
items. 


recall, and reported many 
the Unlearning Ss, 


more ins 

In an influential study of transfer 
and retroaction effects in the A-B, 
A-C, and A-B, A-B’ paradigms (in 
which, respectively, two successive 
lists have dissimilar—B, C—or simi- 
lar—B, B’—responses, and identical 
stimuli), Barnesand Underwood (1959) 
proposed that Ss commonly unlearn 
List 1 responses when they are dis- 
similar to List 2 responses. However, 
when List 2 Tesponses turn out to be 
similar to List 1 responses, the List 1 
responses are used as mediators of the 
List 2 responses, and are therefore 
preserved from unlearning. As one 
goes from dissimilarity to similarity 
of responses, unlearning presumably 
gives way gradually to mediation, 
Evidence has recently been provided 
that some unlearning can occur in 
what appears to be an A-B, A-B’ 
paradigm (Dallett, 1964) : the experi- 
ments to be reported were designed 
to explore the role of mediation in the 
A-B, A-C paradigm. Specifically, an 
attempt was made to vary the relative 
predominance of mediation and un- 


1 At the time Exp. I was performed, K. M. 
Dallett was a Predoctoral Research Fellow, 
on Fellowship MF12-379 from the National 
Institute of Mental Health, United States 
Public Health Service. Experiments II and 
III were supported by Research Grant MH 
06589-01 from the same agency. 


ifornia, Los Angeles 
st list of an A-B, A-C paradigm, and 
ning. Mediation Ss were 
to List 2 learning, while Unlearn- 
ponses in order that they not inter- 
ifferential instructions did not affect 


es) 


but did affect the relative difficulties of List 2 
The Mediation Ss were slig 


htly better in subsequent List 1 
tances of interlist mediation than did 


learning by means of differential 
instructions, 

The three experiments to be re- 
ported are essentially similar in plan. 
After learning the first (A-B) list nO 
à fixed number of presentations, ^d 
were divided and differently instructed 
for List 2 learning. Half of the Ss (the 
Mediation or “M” condition) pé 
told that they would do best to seek 
Some connection, however bizarre, je 
tween the List 1 response they ha^ 
learned and the corresponding List 
response. The other Ss, in the us 
learning (U) condition were warned 
that List 1 responses would interfere 
with List 2 learning, and advised t? 
Suppress or unlearn the List 1 response 
as quickly as possible. All Ss were 
then trained on the second (A-C) lists 
and finally given an unpaced reca 
test of the MMFR variety to assess 


; ist 1 
the amount of unlearning of List 
responses, 


MzTHOD 


Materials—The lists were those vd a 
€ A-B, A-C (or Six) paradigm of Da Ive 
, and are given in that paper. ite 
laze trigrams of high-association value ee 
the stimuli, with adjectives as resp or ar 
umuli and responses were used in ie 
different Pairings (only three of which we gs 
used in Exp. I). The different paitini, 
differed in Pairing of stimulus and respon? 
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TABLE 1 


?, PRESENTATION RATE, AND DEGREE 


E LEARNING IN Exp. I-11II 


Degree of Learning 
Exp. E Rate Cond. 
List 1 List 2 

I Dallett 253. M,U 5 trials | 1 perfect trial 
M,U 15 trials 1 perfect trial 

II D'Andrea 2:2 M,U 12 trials 12 trials 

€ 12 trials — 

HT D'Andrea 3:19 M,U 9 trials | 9 trials 


Note.—Conditions: M = Mediation, U = Unlearning, C = Control (List 1 only). 


in all of these pairings the pairing of responses 
between lists was the same. Thus, referring 
to the lists given by Dallett (1962), aG 
always the List 2 response corresponding to 
(ie, paired with the same stimulus as) 
UNKIND on List 1, BELOVED corresponded to 


Was 


PETTY, ete. l'or each S, lists were presented 
in four different orders to reduce serial 
learning. 


Procedure.—Each S was seated before the 
memory drum and given conventional in- 
Structions for paired-associate learning. He 
Was not required to respond on the first 
Presentation, but was to attempt to anticipate 
each response on trials after the first. It was 
made clear that errors would not be penalized. 
The first list was then presented for a fixed 
number of trials, the exact number of trials 
and the rate of presentation differing from 
experiment to experiment (Table 1). Follow- 
ing List 1 learning, the drum was stopped and 
S was given one of two sets of instructions. 
For the M condition, the instructions were: 


Now I would like you to learn a second 
list. "The syllables will be the same but the 
Words are different. Try to use the words 
you have already learned to help you in 
learning this new set of words. If you can, 
try to find some way in which you can 
relate them, or some way in which the word 
on the first list helps you remember the 
word on the second list. For instance, if 
the first-list word was speedy and the new 
word is Jacy, you might want to think of a 
girl in a lacy dress driving a sports car—or 
You might just want to remember that lacy 
goes with the same syllable that. speedy 
went with before. Some people claim that 
the more ridiculous the association. you 
form between the words, the easier they 
àre to learn and remember. Exactly what 
you do is, of course, up to you. Try to 


learn this second list as quickly as possible. 
But try to make what you have already 
learned on the first list help you as much 
as possible. 


"he Ss in the Unlearning groups were told: 


Now I would like you to learn a second 
list. The syllables will be the same but the 
words are different. Try to forget the 
words you have already learned to these 
syllables: if you do not they will interfere 
with learning this second list. Most people 
who have trouble learning this second list 
say it is because words from the first list 
keep intruding; try to prevent this, Ex- 
actly what you do is, of course, up to you. 
Try to learn this second list as quickly as 
possible. But be on the lookout for inter- 
ference from the first list and try to sup- 
press or forget the first-list words as soon 
as you can. 


Two aspects of these instructions require 
comment. First, while Barnes and Under- 
wood (1959) emphasized mediation “chains” 
of the A-(B)-B' variety, the present instruc- 
tions also allow for "rings" in which the two 
responses may be separately related to the 
stimulus and to each other, and for a “branch” 
structure in which the B and C responses are 
learned as compatible responses to the same 
stimulus. For purposes of constructing the 
instructions, all of these were treated as 
mediators, or as alternatives to B-C in- 
compatibility. Secondly, the instructions in- 
cludean escape clause for 5; this was inserted 
to prevent an overconscientious S for whom 
the instructions might be disadvant 
from attempting to 


ageous, 
prevent learning which 
occurred spontaneously or by other means. 
In short, the instructions were designed to 
influence S's behavior but not to prescribe it, 
What S does as a result of such an instruc- 


TABLE 2 


S AND SDs OF PERFORMANCE IN List 2 
ING AND SUBSEQUENT List 1 RECALL 


List 2 List 1 Recall 
Exp. Cond. 

M SD M SD 
M-5 11.55| 4.07] 3.44| 3.70 
U-5 12.67 | 5.53 | 3.39 | 2.64 

I 
M-15 | 10.27] 5.11| 9.17 | 2.71 
U-15 | 11.16] 4.63| 7.11| 3.98 
M 79.35 | 16.28 | 7.25 | 3.54 
II U 72.95 | 23.25 | 5.70 | 2.83 
Control | — — |10.10| 1.78 
Ill M 61.60 | 16.65 | 9.05 | 2.52 
U 64.40 | 17.73 | 7.60 | 3.37 


,Note.—The List 2 learning measures are trials to 
itetion (Exp. I) or total correct anticipations (Exp, 
» HI. 


tional stimulus is a theoretical question re- 
quiring more than an assump 


tive answer. 
, The Second list was then presented to a 
criterion of one perfect trial (Exp. I) or for a 
fixed number of trials (Exp. II, III) at the 


same rate used for List 1, These details 
appear in Table 1, 

Immediately after List 
given an MMER test of rı 
were handed a dittoed list of the stimulus 
syllables, each with two vertically arrayed 
blank spaces on which S was asked to enter 
the two responses. No time limit was placed 
on this test of recall, The syllables were 
presented in three different orders to different 
Ss in Exp. I, with five different orders being 
used in Exp. II and III, 

Finally, each S was hande 

taining all 12 stimuli each Paired with the 
appropriate set of two Tesponses. On this 
sheet Ss were asked to indicate any “associa- 
tions, mnemonic devices or ‘tricks’ ” which 
had been used in learning any of the items, 
Most Ss seemed to have no difficulty in filling 
this out, and no attempt was made to force 
answers. 

Subjects and interexperimental differences, — 
Most of the differences between experiments 
are outlined in Table 1. The Ss in all three 
experiments were students in introductory 
psychology classes who participated as part 
of a class requirement, and were assigned to 
conditions in rotation prior to their appear- 
ance in the laboratory. Experiment I was 
carried out at the University of California, 
Berkeley, in the summer of 1961 using a 


2 learning, Ss were 
ecall, in which they 


d a sheet con- 
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Phipps and Bird memory drum. In Exp. à 
there were 18 Ss in each of the four groups, 
most, but not all, being naive with respect to 
learning in the laboratory. In Exp. I it was 
necessary to replace two Ss who were ae 
trying to anticipate overtly on carly trials, 
and a third S was discarded for E crror. 

Experiments II and III were carried out 
at the University of California, Los Angeles, 
in 1962 and 1963, using a Stowe .(459B) 
memory drum. Experiment II contained, m 
addition to the M and U conditions, a contro 
group which learned the first list, was given 
the Block Design and Digit-Symbol subtests 
of the WAIS for the time passed by ue 
groups in List 2 learning, and was tested for 
List 1 recall. In Exp. II and III each group 
contained 20 Ss, none of whom had prior 
experience in learning experiments. In Exp. 
II one S had to be discarded for E error. 


RESULTS 


List 1 learning—Analyses of vari- 
ance were used to compare the M and 
U conditions in each i apis 
separately. The groups were treated 
identically during first-list m 
and in none of the experiments A 
significant differences appear, the 
ratios for Exp. I and IlI being less 
than 1.00, and for Exp. HI, F (2, 57) 
= 1243,p 7.05. Intralist and Other 
errors were also examined. In List 1 
learning as in all of the analyses to be 
reported, examination of overt errors 
failed to yield findings of significance, 
and will not be further discussed. i 

List 2 learning —Analyses of van 
ance comparing trials to criterion : 
the M and U conditions of Exp. I an 
total correct anticipations in Exp- ! 
and III yielded, respectively, £ (1, 68) 
= 6n P (1, 38) = 1420, F (1, 38) 
= .294, all p’s > .05. The means an 
SDs of these List 2 performance meas- 
ures appear in Table 2, where it iS 
apparent that even the direction O 
nonsignificant differences is not COn- 
sistent. Differentia] instructions 4° 
not appear to have influenced overa 
Performance level in List 2 learning- 

Owever, this does not mean that 


MEDIATION AND UNLEARNING INSTRUCTIONS 


learning proceeded identically in M 
and U conditions. Closer examination 
of the data indicates that Ss given 
mediation instructions tended to do 
best on List 2 items corresponding to 
well-learned List 1 items, while Ss 
given unlearning instructions show no 
such tendency. In order to make 
these comparisons, the 12 itéms on 
each S's first list, and the 12 items on 
his second list, were separately rank 
ordered according to the number of 
correct anticipations each item had 
obtained in learning. The two sets of 
ranks for each S were used to obtain 
the sum of squared differences be- 
tween the ranks for corresponding 
List 1 and List 2 items. One such 
score was obtained for each S, ex- 
pressing the relation between the 
difficulty or degree of learning of cor- 
responding items on the two lists. 
This score (XD?) is an index of rela- 
tionship which can be transformed 
into the Spearman rank-order coeffi- 
cient of correlation. An analysis of 
variance of these scores was carried 
out, using M-U as one variable and 
Comparisons as the other variable. 
Experiment I furnished 5-trial and 15- 
trial comparisons while Exp. II and 
III furnished one M-U comparison 
each. The analysis revealed a sig- 
nificant M-U difference, F (1, 145) 
= 9,3441, p < .001, no difference 
among comparison means, F (3, 145) 
= 1.003, and a significant interaction, 
F (3,145) = 4.397, p < 01. Individ- 
ual £ tests were used to test M-U 
differences in each of the four com- 
Parisons. The results of these t tests, 
together with the correlation coethi- 
cients corresponding to the mean zD? 
for each group, appear in Table 3. It 
will be seen that except for the 5-trial 
groups of Exp. I, the M groups show 
Consistently higher mean correlations 
that the U groups, the difference being 
Statistically significant in two of the 
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TABLE 3 


RANK-ORDER CORRELATIONS OF List | 
AND List 2 [ITEM DirricULTY 


Exp. M U Hi 
I-5 444 .266 1.14 
I-15 .299 —.068 4.19** 
II .238 O74 1.62 
Ill 324 A17 2.46* 


Note.— The ¢ ratios for Exp. I have df = 34, those 

for Exp. II and III, df — 38. g 
*p <05. 
**$ <.01. 


four comparisons. The conditions for 
which this effect is not significant are 
the 5-trial conditions of Exp. I, in 
which Ss have not learned enough of 
List 1 to provide stable mediators or 
stable indexes of relative item difficul- 
ties, and the conditions of Exp. II 
in which the presentation rate was 
slightly faster than the rates used 
in Exp. I and II. It is worth not- 
ing that there is no evidence to 
suggest that usually, in the A-B, A-C 
paradigm, the best-learned List 2 
items should correspond to the most 
poorly learned List 1 items, although a 
simple response-competition assump- 
tion might lead to such a prediction. 
On the contrary, the data of Under- 
wood (1951) indicate a positive cor- 
relation, and there are reasons (e.g., 
common stimuli) for expecting this to 
be generally true. While the differ- 
ences in these interlist item-difficulty 
correlations are not large, they are 
consistent with the assumption that 
Ss were responding differentially to 
the different instructions which pre- 
ceded List 2 learning; under such an 
assumption Mediation Ss should focus 
first on items for which well-learned 
List 1 mediators are available, while 
Unlearning Ss should ideally concen- 
trate on items for which there is least 
unlearning to be done. 
Recall.—Second-list recall was vir- 
tually perfect in every condition, and 
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TABLE 4 


CLASSIFICATION SCHEME USED IN TABULATING MNEMONICS 


Category Definition 


Example 


S Mentions only the stimulus. 


n 
DEP-HEAVY-COMPLETE: "DEP — deef. 


List 2 response (S-2). 


Separate associations linking stimulus 
and List 1 response (S-1) or stimulus and 


DEP-HEAVY-COMPLETE : "DEP = deep, 
which suggested HEAVY: On List 2, COM- 
PLETEly DEPendent" “DEP-HEAVY both 
have E, while DEP-COMPLETE share P. 


appropriate to both responses. 


Single mnemonic incorporating the stim- 
ulus and both responses, or a single stim- 
ulus interpretation which is described as 


DEP-HEAVY-COMPLE ‘DEP = deep, i 
and things which are HEAVY and COM- 
PLETE are likely to sink, suggesting deep. 


tion to List 2 response, 
S-1, 1-2 chain. 


List 1 response linked to List 2 response 
2 without explicit mention of stimulus rela- 


Can be part of 


“Things which are 
, 
so be HEAVY.’ 


will not be discussed. The Means and 
SDs for List 1 recall appear in Table 2. 
The Exp. II Control 
gests that retroactive 
occurred, and is of n 


condition sug- 
inhibition has 
o further im- 


portance. The differences for the 
5-trial groups in Exp I are obviously 
insignificant, 


Analyses of the M-U 
differences in the 15-trial groups of 


Exp. I, and Exp. II and lII yield 
differences which are not statistically 
significant when each experiment is 
considered separately: for Exp. I, 
F (1,34) = 2.996; for Exp. IT, F (1,38) 
= 2.342; for Exp. Ill, F (1, 38) 
= 2.382. If, however, all of the 
M -U differences (including the 5-trial 
groups of Exp. 1) are included in a 
single analysis, it becomes clear that 
the M groups recalled slightly more 
(a mean of 7.28 items) than the U 
groups (5.99 items) with F (1, 144) 
= 6.186, p < .05. Tabulation of 
List 1 responses as a function of the 
number of times they were correctly 
anticipated in learning suggests that 
the superiority of the M groups occurs 
for items at an intermediate level of 
difficulty (or degree of learning), and 
not for the very strongest or the very 
weakest items. The necessity of a 


combined analysis to achieve signifi- 
cance at the .05 level indicates that 
the overall differences are quite small 
in relation to intragroup variability. 
Reported associative aids.—Vollow- 
ing the test of recall, Ss reported 
associative aids used in learning. 
While such retrospective reports need 
have nothing essential to do with the 
learning process, they still represent 
a facet of verbal learning which needs 
to be better understood. The reports 
obtained in each experiment were 
classified, using the categories out- 
lined in Table 4, These categories 
Were designed after the data from 
exp. I had been examined, and werc 
intended to Capture all reports of 
potential mediators which linked List 
1 items with List 2 items. Operating 
on the assumption that the number 
of such mediators was the essential 
datum of interest, in the final tabula- 
tion dubious cases of mediation were 
included as instances of mediation. 
Teating dubious cases as instances of 
Nonmediation reduces the number of 


Teported mediators but changes no 
conclusions, Each of the authors 


Scored Exp. IT and TI 


; I independently; 
finding the same res 


ults in every in- 
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TABLE $ 
MEAN NUMBER OF MNEMONICS REPORTED AFTER LEARNING 
Category 
Exp. Cond. 
S ES! S2 S-1-2 449 
M-5 22 2.50 5.83 1.55 55 
U-5 55 | 2.50 5.05 1.16 22 
I 
M-15 At 3.22 3.00 2.50 3.38 
U-15 33 5.16 5:95 2.17 A 
M .05 2.40 2.75 1.55 2.00 
H U 35 3.40 4.15 (65 10 
Control A5 3.40 
5 2.60 1.75 3.35 3.00 
II M 70 3.20 4.25 1.00 “10 


Note.—Sce Table 4 for explanation of scoring categories, 


Stance but differing in the number of 
Mediators found; these differences 
could be resolved after discussion of 
the criteria each had adopted. . (The 
tabulation presented in Table 5 is that 
of Dallett.) In examining Table 5, 
the categories of interest are S-1-2, 
and 1-2, which include reports of 
associations which link or incorporate 
both List 1 and List 2 responses. The 
S-1 and S-2 categories, on the other 
hand, represent independent mne- 
monics. As an example, consider 
Cond. M-5 of Exp. I. An average of 
22 mnemonics per S involved only 
the stimulus, and 2.50 4- 5.83 were 
Single-list mnemonics linking stimulus 
and response, for a total of 8.55 out 
of 24 possible single-item mnemonics 
per S. The mediating responses 
included 1.55 in the S-1-2 category 
(involving both S-1 and S-2), and .55 
in the 1-2 category which may have 
been relevant only to List 2, or which 
may have accompanied an S-1. 
Hence, each S gave, on the average, 
Mnemonics for 12.20 out of his 24 
items, a rate close to 50%. The one- 
list Control of Exp. II suggests that 

ist 2 learning does not seriously 
distort the rate of reporting List 1 


mnemonics, which varies from about 
25% to about 45%. Overall rates 
undoubtedly depend upon the ma- 
terials, instructions, and other condi- 
tions of testing, and only relative 
rates are of interest here. It will be 
seen that mediating responses are 
more frequently reported in the M 
groups in each of the eight possible 
comparisons, with a compensatory 
surplus of independent mnemonics in 
the U groups in six of the eight com- 
parisons involving S-1 and S-2 and in 
each of the four comparisons involving 
S alone. 

As the examples in Table 4 might 
suggest, many of these reports have a 
flavor of rationalization about them, 
and may well have arisen in an 
attempt to "explain" (after the fact) 
why a given item was an easy one— 
or they may represent .S's attempt to 
indicate that he is an "intelligent" 
rather than a "rote" learner. At the 
same time, the data are consistent 
with the assumption that Ss in the 
two groups were responding to the 
differential instructions, and that Ss in 
the M groups were especially likely to 
be aware of potential relationships 
linking their first and second lists. 


466 


Discussion 


While Ss’ apparent compliance with 
instructions to mediate or to unlearn did 
not affect the degree of List 2 learning 
at all, it did have a small effect on List 1 
recall. This effect might have arisen in 
two ways: one way is through the differ- 
ence in relations between List 1 and 
List 2 item strengths, and the other way 
is through the assumed Occurrence of 
different numbers of mediating responses 
in the two experimental conditions. 
the item-strength explanation 

sumed that List 1 items weak 
equal to their corresponding L 
are especially susceptible to in 
in the M groups the strong L; 
tended to correspond to strong List 1 


items, while in the U groups there is no 
such relationship. Hence, 


In 
y it is as- 
er than or 
ist 2 items 
terference: 
ist 2 items 


correlations are 


certain List 1 responses, 
practice is reflected in gre 
to interference. The ite 
those weak enough to 
extra practice, and yet 
to be recalled and used 
List 2 learning. 

The results are also relevant to two 
recent experiments which have shown 
that retroaction can be affected by in- 


ater resistance 
ms affected are 
Profit from the 

Strong enough 
as mediators in 
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structions given prior to List 2 learning 
(Postman & Stark, 1962; Schwartz, 
1963). In each of these experiments, the 
instructions related to the forthcoming 
test of List 1 recall, Postman and Stark 
(1962) finding that Ss warned of List 1 
recall did better than those not warned, 
and Schwartz (1963) finding that Ss 
warned that List 2 would make List 1 
recall harder did better than Ss warned 
that List 2 would make List 1 recall 
easier. The present experiments differ 
in that no reference was made to List 1 
recall, and also differed in that the test 
of List 1 recall was unpaced. Otherwise, 
the present results agree with those of 
Postman and Stark (1962) and Schwartz 
(1963); a difference in recall appeared 
with no difference in overall List 2 
performance. The mediation hypothesis 
offers one mechanism through which such 
effects can be explained. 


REFERENCES 


Barnes, J. M., & UxpERwoop, B. J. “Fate 
of first-list associations in transfer theory- 
J. exp. Psychol., 1959, 58, 97-105. <a 

DALLErT, K. M. The role of response simt- 
larity in proactive inhibition, J. exp. 
Psychol., 1962, 64, 364-372, ; 

DALLETT, K. M. Proactive and retroactive 
inhibition in the A-B, A-B’ paradigm. 
J. exp. Psychol., 1964, 68, 190-200. " 

Posrwaw, L, & STARK, K. Retroactive 
inhibition as a function of set during the 
interpolated task. J. verbal Learn. verbal 
Behav., 1962, 1, 304-311. 

Senwanrz, H, A. 
set and resp 
interference, 
127-132, 


Unverwoon, B. J. Associative transfer in 
verbal learning as a function of response 
similarity and degree of first-list learning. 
J. exp, Psychol., 1951, 42, 44-53. 


Influence of instructional 
onse frequency on Egi 
J. exp. Psychol., 1963, 66, 


(Received February 20, 1964) 


Journal of Experimental Psychology 
1965, Vol. 69, No. 5, 467-475 


EFFECTS OF DELAYED AUDITORY FEEDBACK ON MORSE 


TRANSMISSION BY SKILLED OPERATORS 
AUBREY J. YATES?! 


University of Western Australia 


3 experiments were carried out on the effects of delayed auditory feed- 
back (DAF) on the transmission of letters by 6 skilled Morse code 
operators. In the Ist experiment, random sequences of 8 letters were 
transmitted under 3 conditions (each letter transmitted separately, 
continuously at preferred rate, or continuously as fast as possible) at a 
constant delay time of 180 msec. In the 2nd experiment, the same 
sequences were transmitted as fast as possible at delays varying from 30 
to 300 msec. as well as under no delay. In the 3rd experiment, mean- 
ingful material was transmitted as fast as possible under a delay of 180 
msec. The results showed that DAF produces a great increase in the 
number of errors made; that the errors almost always involve an addi- 
tional symbol or symbols; and that letters involving 3 or 4 symbols 


produce many more errors than letters involving 1 or 2 symbols. 


A considerable body of research has 
now accumulated concerning the ef- 
fects of delayed auditory feedback 
(DAF) on speech (Yates, 1963a, 
1963b). It is also known, however, 
that DAF may affect other kinds of 
behavior in similar ways. The dis- 
turbing effect of DAF on rhythmic 
tapping was noted by Lee (1951) in 
his original study and was subse- 
quently confirmed by Chase, Harvey, 
Standfast, Rapin, and Sutton (1959; 
Chase, Rapin, Gilden, Sutton, & Guil- 
foyle, 1961). In both these studies 
it was noted that, under DAF, the key 
was tapped harder, held down longer, 
more taps were given than asked for, 
and the pauses between taps were 
lengthened. More recent studies have 
extended these findings. Chase, Sut- 
ton, Fowler, Fay, and Ruhm (1961) 
found that disturbance in tapping 
thythm was produced by a feedback 
level as low as 10 db. above sensation 
level. Chase, Rapin, Gilden, Sutton, 
and Guilfoyle (1961) showed that 
rate and intensity of tapping were 
affected by decreased auditory feed- 


! Now at the University of New England, 
Armidale, N. S. W. 


back (where S had been trained to tap 
at a specific rate, and with a specific 
amount of pressure). DAF was found 
to produce much more disturbance in 
a complex tapping task than in the 
specific rate/pressure task. 

It is well known that the optimal 
delay (at high feedback intensity 
levels) for producing maximal speech 
disturbance is about 180 msec. How- 
ever, very little is known at present 
concerning the explanation of this 
phenomenon, except that the time 
relationships between the output and 
its associated feedback are probably 
very important. Unfortunately, the 
fundamental speech units involved in 
monitoring are not known and hence 
the analysis of speech disturbance is 
not very suitable for the elucidation of 
the basic time relationships between 
sensory reception, central processes, 
and motor behavior under conditions 
of DAF. The systematic investiga- 
tion of complex tapping performance 
under normal and delay conditions 
may be more suited for this purpose, 
since the units involved (dots and 
dashes) are discrete and readily quan- 
tifiable in terms of errors and trans- 
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TABLE 1 


LETTERS AND SYMBOLS USED 
IN Exr. I anp II 


Two Three Four 
EUN Symbols Symbols Symbols 
ms [UM x TW self aee 
Tal N € [2 me]B c 


mission times. The experiments re- 
ported here have attempted to clarify 
these relationships by making use in 
the first two experiments of a re- 
stricted sample of the Morse code 
alphabet. In the first experiment, 
skilled operators transmitted letters 
involving one, two, three, or four 
symbols under a delay of 180 msec. at 
three rates of sending. In the second 
experiment, the effects of varying de- 
lay intervals were investigated. In 
the third experiment, the effect of 
DAF on continuous transmission of 
meaningful material was studied. The 
basic hypothesis was that the effect of 
delay would be greater, the longer the 
unit being transmitted, on the as- 
sumption that with very short units, 
the letter will have been transmitted 
before the delayed feedback is re- 
ceived, whereas with longer units, the 
message will be still in process of 


transmission when the delayed feed- 
back commences. 


Pitor STUDY 


In a series of preliminary experi- 
ments, utilizing the materials de- 
scribed below, it was verified that Ss 
did not make errors when transmitting 
under normal (no-delay) conditions 
while wearing headphones; nor did 
they make errors when masking white 
noise was presented through the head- 
phones while they were transmitting, 
so that they could not hear the sound 
of the key. It was concluded that the 
absence of auditory feedback did not 
affect the accuracy of transmission. 
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EXPERIMENT | 
Method 


Subjects—The Ss were six employees of 
The Postmaster General's department. All 
were highly skilled operators, with many 
years of experience, and were accustomed to 
receiving and transmitting in noisy conditions 
over long distances. : 

Apparatus—The S transmitted with a 
standard Morse key, the output of which, 
under the delay condition, was fed into a 
delayed feedback machine and returned to 
S's ears through headphones after a delay of 
180 msec. The amplified playback intensity 
at the headphones was set at 100 db., as 
indicated by a Dawe sound-level meter, Type 
No. 1400E, which was acoustically coupled 
to one ear phone of the headphone set. The 
coupling was by a sound column of } in. 
between the microphone and the headphone. 

Materials—The eight letters shown in 
Table 1 were used. Tt will be noted that 
there are four groups of letters, with two 
Tepresentatives each of letters involving one» 
two, three, and four symbols. ‘Twelve lists 
Were constructed, each containing 32 letters, 
ie. each letter appeared four times. The 
order of the letters in cach list was determined 
from tables of random numbers. Each list 
was printed on a separate sheet, with eight 
rows to each sheet, Thus, a different list was 
used with each of the 12 conditions describe 
below, 

Experimental design.—Three transmitting 
conditions were used. In the first condition, 
S transmitted each letter separately, i.e., With 
a slight pause between each letter. In the 
second condition, 5 transmitted continuously 
at his own preferred rate. In the thir 
condition, $ transmitted continuously as fast 
as possible. Each of the 3 conditions Was 
replicated four times, the sequence of the 1 
Conditions being randomly determined for 


each s. ach 5 was tested individually. " 
Scoring.—An electrocardiograph pen T€ 
cord 


er Was used to produce a permanent 
record of S's performance, No difficulties 
Were encountered in scoring the records for 
TTOIS and transmission time for individua 
letters was readily calculated. 


Results 


Table 2 shows the distribution of 
errors, summed over Ss, for the three 
transmission rates, by letter. Several 
Striking facts are apparent. Errors 
volving letters of three or four 
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symbols are nearly 15 times as com- 
mon as errors involving letters of one 
or two symbols. Errors for four- 
symbol letters are twice as frequent as 
errors for three-symbol letters. For 
the three- and four-symbol letters, the 
errors for D and B (dash followed by 
dots) are twice as frequent as the 
errors for W and J (dot followed by 
dashes), respectively. Finally, an 
equal number of errors is produced in 
all three transmitting conditions, the 
pattern of errors for the eight letters 
being very similar for each condition. 


Discussion 


As noted in earlier experiments on less 
complex tapping tasks, it is clear that 
DAF has a strikingly disruptive effect 
on the accuracy of transmission of letters 
of the Morse code by highly skilled 
Operators, who are accustomed to trans- 
mitting under difficult conditions. Fur- 
ther, as hypothesized, the effect of DAF 
is negligible provided the letters being 
transmitted involve only one or two 
Symbols. A very significant increase in 
errors occurs, however, when the letters 
being transmitted involve three or four 


TABLE 2 


ERROR Scores ron Eactt LETTER UNDER 
THREE TRANSMITTING CONDITIONS 


Exp. I 
Transmitting Cond. 
is Fast Percentage 

Fast à: a 

Separately| Perret | possible | Error 
E 1 0 1 1 
T 0 1 0 0 
1 0 2 
N 3 n 2 4 
Ww T 10 11 
D H 25 19 23 
J 17 22 25 22 
B 37 39 40 40 
Total) o8 | 103 | 97 13 


Note.—For each cell in the table, He een 
Possible error score (summed over Ss) was 90. 
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TABLE 3 


= (ix MsrC.) For EIGHT 
2 NDER THREE CONDITIONS 
(180-Msrc. DeLay): Exp. I 


‘TRANSMISSION 


Transmission Cond. 
Letters s 

Separately | Own Rate | Fast as 

UN 2 Possible 
E 050 046 044 
T 268 239 197 
A 374 336 286 
N 317 291 267 
y 556 520 477 
D 483 446 420 
J 824 711 668 
B 631 575 559 


symbols. Furthermore, the severity of 
the effect of delay on the letters involving 
longer symbols appears to depend on 
whether the symbol consists of a dot 
followed by dashes or the reverse, with 
the latter being more susceptible to error. 
Examination of the individual records 
showed that this effect was clearly 
present in five Ss for the letters J and B, 
and in three Ss for W and D (the sixth S 
was unaffected by DAF). 

These results, therefore, may appear 
at first sight to provide some support for 
the hypothesis that the transmission 
time of one- and two-symbol letters is so 
short that transmission of the signal has 
been completed before the feedback 
arrives back at the cortex and hence 
cannot be disrupted by it. (It would 
appear to be equivalent, therefore, to 
transmitting in the absence of feedback— 
as we have seen, skilled Ss are capable of 
doing this without error.) 

A more direct check of the hypothesis 
may be obtained, however, by calculating 
the transmission times for the eight 
letters used in this experiment. If the 
hypothesis is correct, then the transmis- 
sion time for letters involving one and 
two symbols should be less than the delay 
time used in this experiment, i.e., 180 
msec., while the transmission time for 
three- and four-symbol letters should be 
greater. The objective pen tracings of 
S's performance enabled a check on 
transmission times to be made, since the 
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paper traveled at a constant speed. The 
transmission time was taken to start at 
the point where the pen began its up- 
stroke on the first symbol and to ter- 
minate at the point where the pen began 
its downstroke on the last symbol of a 
letter. The transmission time for all 
letters correctly transmitted was calculated 
in this way with the results shown in 
Table 3 for all Ss combined, a total of 
2,007 responses being scored. 

It will be noted that the instructions 
to transmit separately, at preferred rate, 
or as fast as possible, were successfully 
communicated to S, since for each letter 
there is a consistent increase in trans- 
mission speed over the three conditions. 
However, it is clear that the results offer 
no support whatever for the hypothesis, 
since all letters except E took longer than 
the delay time (180 msec.) to transmit. 
Since few errors were made on the letters 
T, A, and N, it follows that overlap 
between transmission time and delay 
time is not a sufficient condition for the 


disruption of performance in this par- 
ticular task, 


EXPERIMENT lI 


It is well known that there is a 
differential effect of DAF on speech at 
high intensities of feedback (Butler 
& Galloway, 1957). Black (1951) 
showed that there is a discrete step- 
wise increment in the effect at a delay 
of 60 msec., that between 60 and 180 


msec. there is a linear relationship 
between increased delay and increased 
disturbance of Speech, and that be- 


tween 180 and 300 msec, delay the 
effect remains fairly constant. With 
delays longer than 300 msec., the 
effect gradually lessens and Speech 
returns to approximately normal when 
the delay becomes greater than 500 
msec. Little is known, however, of 
the effects of differential delay on 
tapping, although a recent study by 
Rapin, Costa, Mandel, and Fromowitz 
(1963) with children which used de- 
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lays from 0 to 1,000 msec. in 100-msec. 
Steps suggests that delays up to 
1,000 msec. produce roughly the same 
amount of disturbance. Nothing 1s 
known of the differential effects of 
delay on skilled tapping performance. 
Experiment II was designed to throw 
light on this problem. 


Method 


The Ss, apparatus, and materials were ex 
same as those used in the first experiment, 


this experiment being carried out immediately 
after Exp. I. 


Experimental design.—The 12 sets of 32 
letters were randomly paired with 12 delay 
times, the randomization being carried out 
afresh for each S. Ten different delays were 
used (30-300 msec, by 30-msec. increments). 
One of the sets was transmitted under T 
delay prior to the delay conditions, while 
another set was also transmitted under e 
delay after the delay conditions. All sets o 
letters were transmitted under instructions 


F it 
to work as fast as possible and to transmi 
continuously, 


Results 


The number of errors for each letter 
under each delay is shown in Table 4. 
There appears to be a discrete incre- 
ment in the number of errors as the 
delay time increases from 90 to 120 
msec. Delay times greater than 9 
msec. produce roughly the same 
number of errors up to the limit of 300 
msec. used in this experiment. These 
results agree with those of Rapin et al, 
(1963) as far as they go. Letters 10- 
volving three and four symbols pro- 
duce many more errors than letters 
involving one or two symbols, being 
approximately six times as frequent- 

Owever, there is no suggestion e 
this experiment that Letters W and J 
Produce fewer errors than D and B; 
if anything, there is a reverse tend" 
ney. In Exp. I, the delay used W25 
180 msec. Reference to the delay 
time of 180 msec. in Table 4 confirm? 
the absence of this differential effect- 


DELAYED FEEDBACK AND MORSE TRANSMISSION 471 
TABLE 4 
Delay Time (in Msec.) 
Letters Percentage 
oo | 30 | 60 | 90 | 120 | 150 | 180 | 210 | 240 | 270 | 300 | 00 | Total Een 

E 0.0|0|0/0/|0|0/|0/|0/|0/,0/| 0 0 0 

I (i ee a PT) 0| 0| Oo} 0| 1] Oy oO} 9 2 1 

A 61€ | Bile 1| 6j 6] 1l] GT] t 1 1 7 2 

N 0|0/|0/,0 4| oO | t 1/5] 4) 3] 0| 18 6 

W | b$]3 ol Si 9 £| &) eh apa 39 14 

D 06|90[90/|60 $! $1 5 i 7! mI ea 27 

J 3|o|4]|90 a 35 8] 6| 5| 9| 6] 0 51 18 

B BILIIJS]t 5| 3] 9 *&|s]| oy SER 45 16 
Total 3 | 3 | 9 | 3 | 20 | 16 | 29 | 20 | 31 | 31 | 23 | 1 | 189 7 


Note.—For each cell in the table, the maximum possible error score (summed over Ss) was 24. 


Discussion 


While these results in general confirm 
the findings of Rapin et al. (1963) regard- 
ing the effects of different delays on 
tapping, there are two features of the 
results which require comment. First, 
the absence of a differential effect for 
three- and four-symbol letters, according 
to whether they commence with a dot 
and end with dashes or the reverse, is not 
readily explained. There is no doubt 
that, allowing for the absence of errors at 
very short delay times, the number of 
errors produced is proportionately as 
great as in Exp. I. (Thus, at a delay of 
180 msec., 29 errors resulted from one 
quarter the number of symbols used in 
Exp. I. This gives an adjusted error 
total of 116 which compares quite well 
with the 97 errors produced under the 
equivalent condition in Exp. 1.) 

Second, the relative lack of errors at 
the short delays confirms the finding in 
Exp. I that overlap of the transmitted 
and delayed signals is not a sufficient 
Condition for the disruption of perform- 
ance, If the one- and two-syllable letters 
had been transmitted faster than the 
delay time in Exp. I then the shorter 
delays used in Exp. H should have 
disrupted correct transmission of these 
letters. However, aside altogether from 


the results obtained in Exp. I, it is clear 
from Table 4 that the shorter delay times 
do not have any differential effect on 
transmission of the one- and two-syllable 
letters. This finding is also confirmed by 
an analysis of transmission time scores 
for correctly transmitted letters, as 
shown in Table 5. There are two in- 
teresting features of this table, however. 
First, it would appear that the frequent 
occurrence of incorrectly transmitted 
letters in the delay condition has no effect 
on the speed of sending of adjacent cor- 
rectly transmitted letters since the in- 
dividual letter and overall average times 
for the two no-delay conditions are not 
markedly different from those for the 
delay times. Second, the transmission 
times for each individual letter remain 
remarkably constant over the different 
delays, that is, correctly transmitted 
letters are sent at roughly the same 
speed whether the delay time is long or 
short. 


EXPERIMENT III 


This experiment was concerned with 
the effects of DAF on the accuracy 
of transmission of meaningful prose 
material. No previous studies have 
been published which investigate this 
problem. 
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TABLE 5 


"TRANSMISSION TIMES (IN MSEC.) FOR E1G 


IT LETTERS UNDER 


DIFFERENT DeLay Times: Exp. II E 
Delay Time (in Msec.) 3i 
Ms 00 30 60 90 120 | 150 | 180 | 210 | 230 | 270 | 300 | 00 
5 41 | 042 | 044 
44 | 046 | 043 042 | 046 045 | 040 048 042 04 * 67 
E 061 166 161 | 172 4179 | 178 171 | 165 | 179 168 154 i ae 
A 260 | 265 | 270 | 283 | 267 | 283 259 | 252 269 | 254 | 255 24s | 250 
N 244 | 247 | 243 | 257 | 253 | 260 251 | 248 | 255 | 258 280 430 | 436 
w 446 | 442 | 440 | 431 | 437 452 | 448 | 423 | 438 417 422 383 | 302 
D 379 | 386 | 393 | 396 | 400 403 | 388 | 399 | 411 | 303 378 pn 609 
J 628 | 628 | 603 | 610 | 596 622 | 640 | 592 | 613 621 | 580 pa 534 
B 499 | 528 | 530 | 549 550 | 566 | 528 | 530 544 | 546 | 519 322 337 
M | 333 | 338 | 336 | 343 341 | 351 | 341 | 331 345 | 337 | 324 | 322 | 
nis " »no-delay 
Method conditions only (errors in the no-delay 


The Ss and apparatus were the same as 
those in the previous experiments, 

Materials—Two sets of six sentences each 
Were prepared. One set consisted of sentences 
Whose word structure was relatively uncom- 
plicated (e.g., "Sometimes I Climb mountains 
in my dreams,"), the Other consisted of 
Sentences whose worq Structure was more 
complex (e.g, “crystallography and physical 
chemistry”) though it is clear that the dis- 
tinction might hardly be significant to such 
highly skilled Ss, 

Experimental design.—Each S transmitted 
both easy and hard Sentences (at the speed 
which he would employ under normal trans- 
mission conditions) under no-delay and delay 
conditions. "The order of Passages and of 
delay/no delay was randomized, The delay 
used was 180 msec, A total of 1 
was contained in the six : 
while there were 209 letter. 
cult" passages. 


90 letters 
“easy” sentences; 
s in the six "diffi- 


Results 


Table 6 shows the frequency. dis. 
tribution of occurrence (summed Over 
the six Ss) of each letter contained in 
the "easy" and “hard” sentences, 
arranged in groups, according to 
whether the letters are represented by 
one, two, three, or four symbols. The 
table also shows the absolute number 
of errors (summed over Ss) for each 
letter under delay and no-delay condi- 
tions, together with the percentage of 
errors for each letter for the delay 


conditions being negligible). pst 
Inspection of the table shows en 
relatively few errors are made as 
letters involving one or two ur 
(with the exception of Letter M S 
the “difficult” condition) under del 
with a very marked increase in ies 
number of errors for letters sepia! 
three or four symbols. There ee 
to be little difference between [m 
"easy" and “hard” conditions in t i 
respect. Under the no-delay condi- 
tions, the number of errors ne 
negligible. Once again there i a 
to be little difference in the error 
Produced by three- and four-symbo, 
letters. It is interesting to note eia 
the letters K (-.-) and R (.-.) whic : 
are symmetrical, alone produced vir 


-an 
tually no errors among the (hree- a 
Íour-symbol letters, 


Discussion 


The difference between the total mea 

er of errors for the delay and no-de a 
Conditions in both the "easy" (187 as 
against 9) and the "difficult" (212 ik- 
against 8) Sentences illustrates very ey d 
ingly the Profound disruption caused Its 
the delayed feedback. The resu nd 
closely parallel the findings of Exp. I a! 


l ces of 
using nonmeaningful sequen 
letters, 
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TABLE 6 


ERROR SCORES FOR Eacu LETTER in “Easy” anp “Dirr 


CULT” MATERIAL 


UNDER No DeLay Axp DeLay Connitions: Exp, III 


"Easy" Material "Difficult" Material 
Letter No Delay Delay No Delay Delay 

Freq. Errors Freq. Errors % Freq. Errors Freq. Errors % 
E 138 0 128 11 8 144 0 144 5 3 
T 90 0 80 0 0 138 0 138 2 1 
A 60 0 56 0 0 126 0 126 2 2 
I 72 0 70 7 10 120 0 120 7 6 
M 54 0 54 3 6 30 1 30 7 23 
N 96 0 94 1 1 84 0 84 5 6 
D 24 1 22 8 33 30 0 30 9 30 
G 42 1 40 16 38 6 0 6 2 33 
K 6 0 6 0 0 6 0 6 0 0 
0 81 1 78 26 31 78 0 78 35 45 
R 78 0 72 3 1 78 1 78 2 3 
S 90 2 85 24 27 90 1 90 29 32 
U 48 0 44 8 17 18 0 18 2 11 
Ww 12 1 11 3 25 6 0 6 2 33 
B as | o | us | 15 | 31 2| 0 ml 6 | s 
(C 18 1 16 3 17 66 2 66 16 24 
F 12 0 11 2 17 18 0 18 3 17 
H 48 1 43 18 38 66 1 66 36 55 
iv 42 0 40 | 15 36 42 0 42 11 26 
B 48 0 45 14 29 54 0 54 14 26 
Q 6 1 6 1 17 6 0 6 2 33 
V 0 0 0 0 0 6 0 6 2 33 
X 0 0 0 0 0 6 1 6 $ 50 
Y 24 0 23 9 38 24 1 24 12 50 


OTHER RESULTS 


One of the most important findings 
Only briefly mentioned so far is the 
fact that, without exception, the 
errors made did not affect the accu- 
Tate transmission of adjacent letters, 
whether they were transmitted cor- 
Tectly or not. In other words, there 
Was no carry-over of an error on one 
letter to the next. All Ss continued to 
transmit at the same rate under delay 
as they had under no delay (Table 5). 
They appeared to be quite unaware of 
the very large proportion of errors 
which were being made under delay 
9n the three- and four-symbol letters. 
Vhen questioned after completion of 
testing, they admitted having diffi- 


culty with some letters but did not 
realize the extent of the errors. The 
types of errors made are clearly 
illustrated in Fig. 1 which shows the 
transmission of the word “physical” 
under no delay and delay. Trans- 
mission of the letters "i" and “a” is 
unaffected by the errors made on the 
other letters. The time data in Table 
5 confirm this conclusion since these 
correctly transmitted letters were, of 
course, embedded in sequences of in- 
correctly transmitted letters. 
Qualitative analysis of the types of 
errors proved difficult. Inspection of 
the data indicated clearly that an 
error almost without exception in- 
volved the transmission of an addi- 
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tional symbol, rather than omission 
of one of the symbols, of a letter. 
However, although casual inspection 
seemed to reveal that, for example, an 
extra dot was added in letters be- 
ginning with a dash and ending with 
dots, closer examination revealed 
many instances where it seemed pos- 
sible that instead of adding an extra 
dot at the end of a letter, S might be 
inserting a shortened dash immedi- 
ately following the first dash, In 
terms of the time relationship between 
transmitted response and perception 
of the feedback, a demonstration of 
which of these two possibilities was in 
fact occurring, becomes crucial. How- 
ever, the interpretation of the trans- 
mission changes under delay would 


also depend in part on the character- 
istic mode of transmission 
sent letters under no delay. That is, 
in the case of the letter B (-, . +), for 
example, it is possible that some Ss 
transmit a dash followed by three dots 
of equal transmission time, whereas 
other Ss transmit a dash followed by 
three dots of unequal (or progressively 
diminishing) transmission time. De- 
tailed analysis of the data from the 
first and third experiments supported 
this interpretation but the results are 
not reported in detail in this paper. 


of correctly 


Types of errors occurring under delayed auditory feedback. 


SUMMARY AND CONCLUSIONS 


Three experiments were carried (^ A 
investigate the performance of skilled Mr acie 
code operators when the auditory fce cli 
resulting from the transmission of Sein 
was delayed. In the first experiment ae 
sequences of eight letters of the Morse ‘att 
alphabet were transmitted under a pee adë 
delay of 180 msec. while S was transmi I 
either each letter as a separate unit, or ast 
tinuously at own rate, or continuously puer 
as possible. The principal results indica 


n- 
In the second experiment, the Boi 
dom sequences of letters were used ifferent 
transmitted these sequences under 10 di teps) 
delay times (30-300 msec. by 30-msec. dei no 
in addition to two control sequences unt re to 
delay. In all conditions, instructions Wer? 


1 ct incipa 
transmit as fast as possible. The princip 
results indicated: 


n T of 

1. A discrete increase in the numbe 

errors at about 120 msec. delay. times 
2. An equal effect on errors of delay timog 


"aem 3 
Sreater than 120 msec. up to the limit of 
msec. used, 
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3. Confirmation of the findings in the first 
experiment that three- and four-symbol 
letters are much more subject to error under 
delay than one- and two-symbol letters. 

4. Failure to confirm the finding in the 
first experiment that letters beginning with a 
dash and ending with dots produce more er- 
rors than letters beginning with a dot and 
ending with dashes. 


In the third experiment the same Ss 
transmitted meaningful material (categorized 
arbitrarily as “easy” or "difficult") under no 
delay and a delay of 180 msec. The principal 
results confirmed those found in the first 
experiment with the exception of the finding 
concerning the order of dots and dashes. 
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80 human Ss divided into 2 groups of CRF and 2 groups of 25% PRF 


were used in a GSR classical conditioning paradigm to test the d 
nation hypothesis as an explanation of the PRE. Removal of th 


electrodes and instructions as to t 


in the Informed. 
in the Informed grou 
a 2-component CR 
mediated CR. 


The phenomenon of partial (inter- 
mittant) reinforcement leading to 
greater resistance to extinction than 
continuous reinforcement has become 
a firmly established empirical law in 
psychology (at least above the phy- 
letic level of the fish), A frequently 
invoked explanation of this partial- 
reinforcement effect (PRE) has been 
that provided by the di 


V scrimination 
hypothesis (Mowrer & Jones, 1945), 
This hypothesis maintains that the 


rate of extinction of a response is 
inversely related to the similarity be- 
tween acquisition and extinction con- 
ditions. "There have been many at- 
tempts to refute the hypothesis, most 
notable of which have been Weinstock 
(1954), Wilson, Weiss, and Amsel 
(1955), and most recently Theios 
(1962) and Jenkins (1962). 

The attacks on the discrimination 


1 This study was supported in part by Re- 
search Grants K3-MH-4177 and MH-06282 
from the National Institute of Mental Health. 
The authors wish to thank James Block and 
William Swayze for their technical assistance 
in the use of a computer in both data re- 
duction and analysis. A brief report of this 
experiment was presented at the Eastern 
Psychological Association, April 1964, 


The presence of a conditio 
Ps combined with the abi 
characterized as a sim 


iscrimi- 
e shock 


he onset of extinction was assumed to 
provide equalization of discriminabilit 


extinction series for 1 PRF and 1 CR 
inquiry for these Informed groups pri 
independent of resistance to extinc 
the presence of the PRE in the Noninform 


y between the acquisition and 
RF group. A postexperimental 
ovided a test of discriminability 
tion. The results demonstrated 
ed groups and its abolition 
ned GSR during extinction 
olition of the PRE suggests 
ple conditioned CR and a 


hypothesis have, for the most pars 
been so structured as to depenc 
upon Z's evaluation of the similarity 
between acquisition and extinction 
rather than the similarity as perceivec 
by S. The E's evaluation of dis- 
criminability has frequently heen 
directed to the relative abruptness © 
the transition between acquisition ane 
extinction. The discrimination hy- 
pothesis, however, need not be con- 
Strued so as to be solely dependent o 
the abruptness of the local transition. 
Bitterman, Feddersen, and T yler 
(1953) have maintained that discrim- 
inability may be a function of ia 
degree of congruence between tre 
total patterning of the acquisitio! 
and extinction series, Discrimination: 
however, does not lie either in the 
Patterning of the stimuli or in £, ne 
in S. An adequate evaluation of t E 
explanatory power of the discrimina 
tion hypothesis has not been presente 
ue to a failure to obtain a measure r 
S’s ability to discriminate betwee! 
acquisition and extinction which - 
independent of the very response 
Process to which the hypothesis p 
directed, i.e., resistance to extinction 
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The need for such an independent 
evaluation has been noted by Bitter- 
man et al. (1953). According to the 
discrimination hypothesis, therefore, 
if two groups of Ss, one partially rein- 
forced (PRF) and one continuously 
reinforced (CRF), could equally dis- 
criminate between an acquisition and 
an extinction series, the PRE should 
be abolished. In the present experi- 
ment it is assumed that informing Ss 
as to the onset of extinction equalizes 
the discriminability for the two groups. 
An independent measure of the effect 
of the extinction information on S's 
ability to discriminate between ac- 
quisition and extinction was obtained 
by means of a postexperimental 
inquiry. 


METHOD 


Subjects —The Ss were 116 male paid 
volunteer college students 5 y from 
Various schools in the New York City area. 
Three Ss were eliminated due to experimental 
error or equipment malfunction, 7 for failure 
to show a GSR to the UCS, 2 for having a 
negative GSR (an increase in skin resi ance) 
to the UCS, 4 refused to continue because the 
UCS was too strong, and 20 were disqualified 
on the ba of a postexperimental inquiry 
to he described later. A total of 80 5s 
remained. i 
Apparatus —The GSR was measured via 
two Ag/AgCl nonpolarizing sponge electrodes 
described by O'Connell and Tursky (1960) 
and evaluated by O'Connell, Tursky, and 
Orne (1960). A constant 100 pamp. de 
Current was applied to the electrodes via a 
Vheatstone bridge. Skin resistance was 
recorded on one channel of a transistorized 
cight-channel Type R Offner page 
running at a paper speed of 5 mm/sec. T js 
Sain settings varied from .5 my/em to 1 
mv/cm depending on the magnitude of. S's 
response. The UCS was a 500-msec., 23-v. 
1.53-ma, (re: 15,000 ohms) ac shock delivered 
to S's right shin bone via two 1$ X 3 in. 
electrodes placed approximately 4 in. apart 
Over electrode jelly. Ina previous experiment 
(Bridger & Mandel, 1964) this shock level was 
uniformly reported by all Ss as very painful. 
A differential conditioning procedure con: 
ing of one positive (CS+) and four negative 
=) stimuli was used. Each CS was a 500- 
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msec. flash of one of five 25-w. bulbs mounted 
horizontally on a Masonite panel with centers 
6 in. apart. With S lying supine, the panel 
was placed 7 ft. up on a wall approximately 10 
ft. from S’s head. Onset and duration of both 
CS and UCS were controlled by Hunter tim- 
ers, with the onset of the UCS occurring 
simultaneously with the offset of the CS. On- 
set and duration of both stimuli were recorded 
on separate channels of the dynograph. 
Design.—Four groups of Ss were used, 
two PRF and two CRF. At the onset of 
extinction, one PRF and one CRF group 
were informed as to the onset of extinction 
(PRFI, CRFI) as described in the procedure 
section. The two remaining groups were left 
uninformed (PRFN, CREN). A single trial 
consisted of the presentation of each of the 
five lights in a random order. The extreme 
left-hand light was always designated as the 
light to be followed by shock (CS+) with 
none of the remaining lights ever followed by 
shock (CS—). The experiment had three 
parts: (a) a 5-trial adaptation series during 
which S was never shocked; (6) a 20-trial 
acquisition series during which the two CRF 
groups were shocked at each presentation of 
the CS+ (100% reinforcement) while the two 
PRF groups were shocked on Trials 1, 4, 11, 
and 20 (25% reinforcement) ; (c) a 30-trial 
extinction series. The interval between suc- 
cessive CS presentations both within and 
between trials varied from 10 to 20 sec.; in 
a CS light was presented when S's 
ince either remained unchanged or 
was increasing for a few seconds. 
Procedure.— he Ss were randomly as- 
signed to one of the four groups. The S was 
led into a well-lit sound-deadened experi- 
mental room isolated from an adjoining 
instrument room. The S was informed that 
the purpose of the experiment was to measure 
some of his physiologicalresponses. Theright 
palm and ventral surface of S's right wrist 
were cleaned with an acetone solution. A 
2-in. square of adhesive tape with a 1-in. 
diameter hole was placed on his palm. The 
palmar surface showing through the hole in 
the tape and the ventral surface of the wrist 
were coated with Cambridge electrode jelly. 
The GSR electrodes were placed over the 
jelly and held by rubber straps. The S was 
told their function. The shock electrodes 
were attached and their purpose explained as 
S lay supine on a mattress placed on a table. 
He was then informed that the experiment 
would contain three parts: (a) a period when 
his responses to the lights alone, as they 
flashed on and off, would be measured; (b) a 
period when the lights would be paired with 
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t "a iod during which 
ples Cle nim be 
edere, The S was assured that nothing 
would occur during the experiment eec] 
which he had not previously been informed. 
The adaptation series was then presented. 
At the end of the series, E returned to S’s 
room and reiterated that there would be two 
more parts to the experiment; one part during 
which shocks would be given and a second 
part during which no shocks would be given. 
The CS4- light was then designated and S 
told that shocks "would be associated with" 
the light. The Ss in the two Noninformed 
groups were told that E would not return 
until the experiment had been completed. 
The acquisition series was then presented and, 
for the Noninformed 55, the extinction series 
followed immediately without interruption. 
For the Informed Ss, E returned at the end of 
the acquisition series, removed the shock 
electrodes, wiped S's leg clean of electrode 
jelly, and reassured S t 
would or could be given. 
the purpose of the final pai 
Was to see what effect t! 
responses to the lights Presented alone. The 
extinction series was then Presented. 
Postexperimental inquiry —To provide an 
independent measure of S's ability to dis. 
criminate between the acquisition and ex- 
tinction series, a Postexperimental inquiry 
was conducted. The Informed Ss were given 
a questionnaire containing the following 
question, "After you were told that no more 
shocks would be given and the electrodes were 
removed, how sure were you that you would 
not be shocked?” Below this statement 
appeared a horizontal 10-cm. line at one end 
of which was the Phrase, “Positive no shocks” ; 
at the other end, “Positive more Shocks," 
The S was instructed to indicate by a vertical 
line his degree of belief or disbelief, Any S 
who did not mark the absolute end of the line 
indicating "Positive no more shocks” was 
eliminated from the sample, Twenty records 
were discarded by this Procedure, A further 
interview revealed that most of these Ss felt 
they might have continued to be shocked 
through the GSR electrodes, 

Response measure.—By means of a cali- 
bration template for each sensitivity setting, 
it was possible to measure skin resistance at 
any point to the nearest 10 ohms, The final 
score assigned to a response was the change in 
conductance (AC) expressed in micromhos 
according to the formula (1/Ra — 1/Rb) 10*, 
where Rb is the resistance at the moment of 
response and Ra is the resistance at the peak 
ofaresponse. In order for a resistance change 
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to be counted as a response, the ilie eps ii 
skin resistance had to begin not less than 4 
nor more than 4 sec. after the onset of any . 


RESULTS 


Two methods were used to analyze 
resistance to extinction, (a) Anderson's 
(1963) shape-function method, and 
(b) an extinction criterion measure. : 

Shape-function method.—Anderson's 
formula for shape-function Scores 
which is designed to equate for differ- 
ences in acquisition level, follows: 


F(a) = R(») — R(n)/R(«) — R0), 
where for each individual: 


f(n) = shape-function score on Extinction 
Trial n. 
R(») = asymptotic extinction response 
level. 
zxtinction 
R(n) = observed response on Extinctior 
Trial n. 


R(1) — estimated response on Extinction 
"Trial 1. 

R(^) was presumed to equal zero; 
i.e., given a sufficient number of trials, 
it was assumed that all Ss would cease 
responding. The formula then re- 
duces to a Simple ratio score: f(#) 
= R(n/R(1). The R(1) term repre- 
sents an estimate of each S's acquisi- 
tion level before the onset of the 
extinction process, The most un- 
biased and reliable estimate of RG) 
was taken to be the mean acquisition 
level reached by each S. However, 
due to the short CS-UCS interval (500 
msec.) the mean response level to the 
CS+ for the CRF groups could not 
€ obtained, Examination of the 
acquisition Performance of the two 
ion BTOUDS revealed a high correla- 
tion (Pearson r = .81, N = 40) be- 
tween each S's mean response to the 
CS4- (MCR4-) on the 15 nonrein- 
Orced trials and the mean of his 
argest Tesponse to any one of the four 
— lights on each trial (MCR—) 
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for these same 15 nonreinforced trials. 
Assuming that identical results would 
exist for the CRF groups, could their 
MCR+ levels be obtained, the 
MCR.— score for each S in the four 
Sroups was used as the best estimate 
of R(1). Each S in these four groups 
Was assigned an f(z) score on the first 
0 extinction trials based on his own 
acquisition MCR-—. á pry ae 
formed score equates Ss in erms 
differences in ennai GSR E 
he mean f(z) score in micromhos 
each of min groups on the first 10 
extinction trials is presented in Fig. 1. 
Table 1 presents the results of trend 
analyses (Edwards, 1960) performed 
9n these data. In the Noninformed 
Sroups, the significant Groups effect, 
F (1,38) = 6.01, p < .025, demon- 
Strates the classical PRE E me 
FN group showing greater resist- 
ance o ean ation Ta the CRFN. 


Group mean responses on the first 10 extinction trials. 


In addition, using the conservative 
tests suggested by Greenhouse and 
Geisser (1959), significant Trials, 
F (1,38) = 7.45, p < .01, and Groups 
X Trials interaction effects, F (1, 38) 
= 7.84, p < .01, were found indica- 
ting that a response decrement oc- 


TABLE 1 


ANALYSIS OF VARIANCE OF THE FIRST 
10 EXTINCTION TRIALS 


Noninformed Informed 
roups Groups 
Source df 
MS F MS F 
Groups (G)| 1]27843 | 6.01* .08| .03 
Error (a) 38| 46.29 3.19 


Trials (T) 9| 43.46 | 7.45** | .77 | 2.66 
GXT 9| 45.70 | 7.84** | 1.88 | 6.48* 
Error (b) |342 5.83 .29 


*$ «.05. 
**» <.01. 
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TABLE 2 


ANALYSIS OF VARIANCE OF THE FIRST 
5 Extinction TRIALS: 
INFORMED GROUPS 


Source df | MS F 
53 
Groups (G) 1 -82 5 
Error (a) 38 1.54 
"rials (T 4 5.24 10.92** 

Piera i 4 1.63 3.39 
Error (b) 152 .48 

* p <01. 


curred and that the groups differed in 
their rates of extinction. In the 
Informed groups, the lack of a signifi- 
cant Groups effect, F (1, 38) < 1, is 
interpreted as demonstrating the abo- 
lition of the PRE. The lack of 
a significant Trials effect, F (1, 38) 
= 2.66, p > .05, as well as the sig- 
nificant interaction effect, F (1, 38) 
= 648, p < .05, may have been due 
to the reversal of the mean levels of 
the two groups over the last five trials 
as compared to the first five, T here- 
fore, the data of only the first five 
trials were subjected to an additional 
analysis and the results are presented 
in Table 2. The Groups effect re- 
mained nonsignificant, 7 (1, 38) < 1, 
A significant Trials effect was present, 
F (1, 38) = 10.92, p < .01, indicating 
an overall response decrement across 
trials. Combined with the lack of 


a significant interaction, / (1, 38) 
= 3.39, p > .05, the results indicate 
the decrement occurred equally in 
both groups across the first five trials. 

Pseudoconditioning—One of the 
most effective methods of controlling 
for pseudoconditioning is to compare 
each S's response to the CS (CR+) 
with his response to a nonreinforced 
stimulus presented within a short time 
interval (CR—). During extinction 
the pseudoconditioning response level 
may also be considered as the extinc- 
tion level of the CR+. Therefore, 
each S's largest response to any of the 
four nonreinforced lights during ex- 
tinction was deemed to be a pseudo- 
conditioning response, and / tests 
were performed on each group's 
transformed extinction MCR+ and 
MCR— scores. The results of these 
analyses are presented in Table 3. 
The results demonstrate that, for all 
groups, the MCR-+ over the first five 
trials was significantly greater than 
the MCR—. On the second five trials 
the two Noninformed groups’ MCR+ 
Scores were still significantly higher 
than their MCR— scores (PREN: 
P «.05; CRFN:p < .01). For the 
two Informed groups, however, there 
was no significant difference (p > .05) 
between the MCR+ and the MCR — 


Scores and extinction was considered 
to have occurred. 


TABLE 3 


EXTINCTION P. 


SEUDOCONDITIONIN 


G ANALYSES 


First Five Trials 


" Second Five Trials 
Toup 
Mean CR+ | Mean CR— Mean CR+ | Mean CR— t 

PRFI 2522 .116 3.87** 
CRFI "785 -194 4.157 83 E y 
PREN 3.714 347 3.77% 2.423 A 2.71* 
CREN 1.836 .304 9.86*** 962 1678 3.37** 

Note.—df = 19 

*p «.05. 

ep < 01 
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Extinction criterion—This method 
utilized a nonparametric analysis of 
the data. A conditioned response 
(CR) was defined as the GSR evoked 
by the CS on each trial which was 
greater in magnitude (AC) than the 
GSR evoked by any of the four neutral 
stimuli. Each S was then assigned a 
plus or minus on each of the 30 ex- 
tinction trials depending upon whether 
or not he met the criterion for a CR. 
Extinction was considered to have 
occurred when a CR failed to be 
elicited on three successive trials. 
Fach S was assigned a score consisting 
of the number of trials before the ex- 
tinction criterion was met, with a 
maximum score of 30. A one-tailed 
Mann-Whitney U test was used to 
analyze the results. The PRE was 
present in the Noninformed groups 
with U = 106.5, p < .01 and PREN 
> CREN. For the Informed groups 
no statistically significant difference 
could bedemonstrated with U = 183.5, 
p> .05 and CRFI > PRFEI. This 
result is interpreted as demonstrating 
the abolition of the PRE in the In- 
formed groups. The results of this 
method are in agreement with those of 
the shape-function method in that the 
PRE was present in the Noninformed 
and had been abolished in the In- 
formed groups. 


Discussion 


The present results support the hy- 
pothesis that when ease of discrimination 
between an acquisition and extinction 
Series is equalized for a PRF and CRF 
group, the PRE will no longer be present. 

Towever, the presence of the PRE in the 
Voninformed group and its absence in 
the Informed suggests that the CR has 
two separate components. These com- 
Ponents may be categorized as a simple 
Ys. a mediated CS-UCS relationship. 
The following discussion of these com- 
Ponents is based in part On articles 


by Mowrer (1938) and Razran (1961). 


In the simple CR the CS directly 
signals, or is a surrogate for, the UCS. 
In the mediated CR the CS signals an 
anticipation (e.g., fear) of the reinforce- 
ment. This anticipation occurs when 
the CS is part of a compound stimulus 
including both environmental contin- 
gency cues (e.g., shock electrodes) and 
pattern of reinforcement. A given CR 
may contain both simple and mediated 
components. In the present experiment 
the PRF and CRF groups acquired 
different configurational CRs as a func- 
tion of the patterns of reinforcement. 
The extinction instructions and removal 
of the environmental contingency cues 
eliminated the mediated CR for the 
Informed groups, leaving only the simple 
CR to be extinguished. The Nonin- 
formed groups, however, had to extin- 
guish not only the simple CR but also the 
mediated CR. Dependent as it is upon 
the stimulus compound of both contin- 
gency cues and pattern of reinforcement, 
the mediated CR was responsible for the 
classical PRE. The methodology of the 
present experiment allows the separation 
of these two components of the CR and 
demonstrates that they obey different 
laws in regard to resistance to extinction. 

In keeping with the results of most 
experiments in this area, the instructions 
facilitated the extinction of the condi- 
tioned GSR. However, contrary to the 
results obtained by Wickens, Allen, and 
Hill (1963), a significant difference was 
found between the Informed and Non- 
informed groups on the very first extinc- 
tion trial, PRF:ż (38) = 2.50, p < .05 
with PRFN > PRFI and CRF:# (38) 
= 4,10, p < .001 with CRFN > CRFI. 
Using a postexperimental inquiry, Wick- 
ens et al. (1963) reported, “Slightly less 
than one in five Ss reported that they 
were not confident that the shock would 
not be presented [p. 236].” However, 
the response protocols of these nonconfi- 
dent Ss were included in the comparison 
of the Informed and Noninformed groups. 
The present experiment revealed a clear 
difference between the Informed and 
Noninformed groups on the first trial by 
excluding the protocols of the noncon- 
fident Ss. 
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The present results are relevant to 
several other issues. Grant and Schipper 
(1952) have presented a two-factor 
theory of the PRE. These investigators 
postulated that resistance to extinction 
is directly related to the number of 
reinforcements (or response strength) 
and inversely related to the discrimina- 
bility of the acquisition from the extinc- 
tion series. The Grant and Schipper 
hypothesis would predict that when 
discrimination has been equalized, the 
CRF group would show greater resistance 
to extinction than the PRF group. The 
present results do not uphold this deduc- 
tion. However, in terms of both the 
shape-function analysis of the first 5 
extinction trials and the nonparametric 
analysis of the 30 trials, there was a 
statistically nonsignificant tendency for 
the CRFI group to show greater resist- 
ance to extinction than the PRFI group. 
Whether the failure to uphold the Grant 
and Schipper hypothesis is due to the 
strong UCS at a 500-msec. ISI producing 
a conditioning asymptote, or whether, 
in aversive conditioning, a nonreinforced 
trial might be incremental to habit 
strength, remains open for further in- 
vestigation. 

Finally, Chatterjee and Eriksen (1962) 
have maintained that cognitive expect- 
ancy is an essential correlate of heart-rate 
responses. In the present experiment the 
Informed groups demonstrated a condi- 
tioned GSR contrary to their cognitive 
expectancies. 
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or AN EVENT AND ITS SUBJECTIVE PROBABILITY 
AS A FUNCTION OF METHOD OF MEASUREMENT! 


DEAN G. PRUITT ANp ROBERT D. HOGE 
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This study was designed to evaluate the strength of the relationship 
between the value of an event and the subjective probability of that 
eventas measured by 3 distinct techniques. A positive relationship was 
found between these 2 variables under all conditions but diminished in 
strength as pressures for accuracy increased. The results were discussed 
in the light of Rotter’s criticism of earlier studies. In addition, a com- 
parison was made of the 3 methods of measurement, from which proba- 
bility ratings emerged as the most reliable technique for studying re- 


actions to objective probability displays. 


. Several studies have been published 
that show a positive relationship be- 
tween the value of an event and the 
likelihood that a person will predict 
i occurrence (Crandall, Solomon, & 
Pun, 1955; Irwin, 1953; Marks, 
iaa The results of these studies 
über pr pies been cited as evidence 
an m 1e value of an event can have 
(SP) ect on its subjective probability 
re 2. However, Rotter (1954) has 
Wap an objection to this inter- 
the ation, arguing that the value of 
iu in these studies may not 
fios influenced the SP of their occur- 
mu. ih only 'the potential of the 
be pia stating that he thinks they 
soe oer, partly because he has no 
an d to differentiate his true expect- 
cies (SP) from his wishes [p. 164." 
Onsiderable theoretical importance 
attaches to the issue of whether the 
Value and SP of an event are inter- 
related, because the dominant model 
s risky decision making, the subjec- 
mely expected utility (SEU) model, 
p a es no provision for such an inter- 
Clationship (Edwards, 1962). 


For This research was supported by the Air 

tra Ce Office of Scientific Research under Con- 

Tact No, AF 33(616)-7621 and by the Na- 

ne Science Foundation under Research 
ant No, GS.311. 


The present study had two main 
goals: 


1. To replicate the major features 
of the earlier experiments using three 
methods for measuring SP: a guessing, 
a probability rating, and a betting 
method. We reasoned that it would 
be harder to maintain Rotter's posi- 
tion that SP is not involved in this 
kind of study if results similar to 
those found in earlier experiments 
were obtained under all three methods 
of measuring SP. 

2. To test the hypothesis implied 
by Rotter that predictions concerning 
an event will be less affected by the 
value of that event the stronger the 
pressures for accuracy. Assuming 
that the rating method would involve 
greater pressure for accuracy than the 
guessing method, we predicted that 
the value of an event would have more 
effect on the measured SP? of that 
event under the latter method. In 


2 Throughout this report a distinction is 
made between SP and measured SP. The 
former is a psychological process; the latter 
an experimental observation based on one or 
another technique of measurement. This 
distinction must be made because Rotter's 
criticism amounts to the assertion that differ- 
ences in measured SP do not reflect differences 


in true SP. 
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addition, we introduced a fourth con- 
dition in which incentives for accuracy 
were coupled with the rating method. 
We predicted that the measured SP 
of an event would be less strongly 
influenced by the value of that event 
under this condition than under the 
simple rating condition. No predic- 
tions were made about the betting 
method, since we were not sure before- 
hand how much pressure for accuracy 
it would exert. 


METHOD 


Apparatus and procedure—The apparatus 
consisted of two 40-w. light bulbs mounted 
side by side behind a translucent screen. The 
points on the screen where the lights could be 
seen were labeled “A” and “B.” 

During cach trial, 24 flashes of light were 
shown, each flash coming from cither Light 
A or Light B. Both the duration of a flash 
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and the interval between flashes were ap- 
proximately 1 sec. long. The sequence in 
which the lights were flashed was controlled 
from a programing apparatus. 

The 5 sat at a distance of about 8 ft. from 
the screen. In an initial briefing, he was told 
that the sequence of lights was controlled by 
a chance device and that the probability of 
Light A (and therefore of I ight B) changed 
from trial to trial. He also led to believe 
that every sequence terminated in a twenty- 
fifth flash of light, which would be recorded 
for E's use but would be obscured from S’s 
view. 

On cach trial, after viewing the first 24 
flashes, S was asked to make a judgment 
about the twenty-fifth flash. The form of the 
judgment depended on the condition to which 
he had been assigned: 


1. In the guessing condition, S was simply 
instructed to circle one of two letters, “A” 
or “B,” depending on which light he thought 
would appear on the twenty-fifth flash. 

2. In the rating condition, S was asked to 
circle a probability level on the following 
scale: 


Probability of Light “A” appearing 


1.0 9 .8 ok -6 
“A” more likely than “B” 


3. The sa e was used in the rating 
with incent ion, but here the instruc- 
tions stressed accuracy, and .S was informed 
that a cash reward would be given to the two 
Ss whose estimates of the true probabilities 
were most "accurate." 

4. In the betting condition, the following 
statement appeared at the bottom of each 
answer sheet: “For the bet ‘If light A win 100 
points’ I am willing to pay up to — 
points." 

The Ss were told that they were playing a 
betting game, whose currency consisted of 
points rather than money. On each trial 
they had an opportunity to bid for the wager, 
“Tf A win 100 points, which means that if 
Light A comes on you get 100 me if 
Light B comes on you get nothing. Bids were 
vs - writing a number between 0 and 100 
y The Ss were urged 

amount they were 
et, on the assumption 


made b 1 
in the space provided. 
to bid the mast 

villi ay for the b the a 
E nemen determination of the 
hé =" h of the bet would be made and that they 
cat n to purchase the bet, at the amount 


would h ermination, if their bid 


of the independent det 


4 3 2 1 


"B" more likely than AT 


—— 
was equal to or greater than the independent 
determination. They were told that the bets 
would be played off after the session and that 
the two persons with the highest points would 
be given money prizes. They were urged to 
vary the amount bid in terms of the likeli- 
hood, as they saw it, that Light A would come 
on during the twenty-fifth trial. 

The judgments were recorded in a booklet 
containing 1 page for each trial. Each page 
was divided into two parts. In the top part 
was a statement about the amount to be 
credited or debited to S if Light A occurred 
on the twenty-fifth flash. There were five 
possible statements: “If light A win 50 cents,” 
“If light A win 10 cents,” “If light A win or 
lose nothing,” “If light A lose 10 cents,” “If 
light A lose 50 cents.” In the bottom part of 


the page was a place for S to record his 
response. 


In all conditions, Ss were run in groups 
that varied in size from two to five. The E 
satin the room with Ss, operating the program 
by remote control. Each session lasted ap- 
proximately 1 hr. At the beginning of a 
session, 5s were credited with $1.25 and in- 


^ 
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formed that their losses would come out of 
‘his initial payment. At the end of a session, 
Wins and losses were calculated and payoffs 
Made, 

Design and subjects—A factorial design 
Was employed, involving four variables: (a) 
t Ne four conditions described above; ( three 
asic sequences of lights: one in which Light A 
®Ppeared 33%, of the time, one in which it 
gp peared 50%, and one in which it appeared 

vo Of the time; (c) the five levels of value 


(sociated with the occurrence of Light A: 
152 306, win 10¢, win or lose nothing, lose 
PÉ, and lose 50¢; and (d) two replications of 
Ne design, 
co Eighty volunteers from an undergraduate 
ass in psychology served as Ss, 20 being 
signed at random to cach of the four condi- 
oe Each S participated in 30 trials, 
saaa Posed in the following way: the three 
“SIC sequences of lights were exhaustively 
ee Rea £ 


ene Seq i re follows, 

juences of lights were as 
BACs BAABBBBABBBABABBABBBAB- 
ads AAABBAABBABBABABAAAB- 


AABB, 67%; AABABAAABAABABAABB- 
AA, 


fot 


easured SP for the five levels of value. 


combined with the five values, and each of 
these combinations was presented twice. The 
order of trials varied from S to S on a random 
basis. 


RESULTS 


In the guessing method, tests of 
significance were performed on binary 
data, a guess of Light A being given 
a score of “1” and a guess of Light B 
a score of “0.” Group measures of SP 
were obtained in this condition by 
calculating the percentage of all 
guesses in a group that named Light 
A. In the rating conditions, the 
numbers circled by S were used as 
indexes of SP in all calculations. In 
the betting condition, SP was calcu- 
lated for each trial by dividing S's 
bid by 100, a standard technique 
(Preston & Baratta, 1948; Shuford, 
1959). 
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TABLE 1 
ANALYSES OF VARIANCE FOR THE Four CONDITIONS 
d = p 
df E P 
— Guessing Rating Rating with Betting 
; 19 1.72" 2.44% 1.11 | T 
Valjes (B) 4 Sar 2.74" 222 195. 
Linear trend 1 12.60*** 6.43** 2.59 5.57 
Sequences (C) 2 17.26*** 35.44*** 35.19*** 1114559 
AB 76 1.05 1.52* ‘85 1:01 
AXC 38 413*** T.90*** 6.81*** 6.710" 
BXC 8 "76 1:38 ‘87 2.56% 
AXBXC 152 1.02 .82 fd 1.02 
Replications 300 (0.168) (2.92) (3.57) (382.94) 
Total 599 


*p «.05. 
**5 < 01. 
* 5 c 1001. 


Relationship between value and meas- 
ured SP.—Evidence of a positive 
relationship between value and meas- 
ured SP can be seen in all four curves 
of Fig. 1. The statistical significance 
of this evidence is given by the "linear 
trend over values" terms in the anal- 
yses of variance presented in Table 1. 
This term reaches an acceptable level 
of significance in the case of the 
guessing, rating, and betting condi- 
tions and approaches significance in 
the case of the rating with incentive 
condition. 

A visual comparison of the slopes of 
the curves in Fig. 1 suggests, as pre- 
dicted, that the relationship between 
value and measured SP is more pro- 
nounced under the guessing than 
under the rating condition. In order 
to make an adequate test of this com- 
parison, it was necessary to red uce the 
data from these two conditions to the 
same scale. This was done by di- 
chotomizing the rating data, treating 

ratings between .6 and 1.0 as s em 
of Light A and ratings pw nn ens 
4 as guesses of Light B. Althoug 
MS tually represented a 
ratings of .5 actually to 

‘lure to guess, it was necessary 

failu btain a set of data that 
use them to obtal 


would be comparable to the guessing 
data. 'They were arbitrarily treated 
as guesses of Light B, on the assump- 
tion that the constant error which this 
procedure would introduce into the 
averages would have no effect on the 
slope of the curve relating the percent- 
age of guesses to value. Using a 
technique for comparing linear trends 
described by Winer (1962, p. 276), a 
comparison was made between the 
slope of the curve for guessing data 
shown in Fig. 1 and the slope of a 
comparable curve for the dichotomized 
rating data. A / was obtained of 1.85, 
which was significant beyond the 
.05 level with a one-tailed test of 
significance. 

Inspection of Fig. 1 also supports 
the prediction that the relationship 
between value and measured SP would 
be stronger under the rating than 
under the rating with incentive con- 
dition. However, with a £ of .93, this 
can only be considered a trend in the 
predicted direction. 

Individual differences in the degree 
to which measured SP is affected by 
value are reflected in the Ss X Values 
interaction, which reaches significance 
only in the rating method. In a 


EVENT VALUE AND SUBJECTIVE PROBABILITY 487 


side analysis, a correlation (r = 34, 
P = .06) was found between this 
effect and scores on the Pt scale of the 
MMPI, suggesting the existence of 
a direct relationship between psych- 
asthenia and susceptibility to value 
distortions of SP, as measured by the 
rating method. 

Sequence effects. —Average measured 
SPs for the three basic sequences of 
lights are shown in Table 2. As might 
be expected, the averages for the 
sequence containing 33% of Light 
A are close to .33. But the averages 
for the other two sequences seem 
totally unrelated to the frequency 
of Light A; the averages for the 
50% sequence are much too high, and 
for the 67% sequence much too low. 
Table 1 shows that the variance due 
to sequence differences is highly sig- 
nificant for all four methods, indicat- 
ing that sequence differences had an 
important, if not predictable, effect 
9n measured SP. " 

Table 1 also reveals a highly signifi- 
cant Ss X Sequences term for every 
method. This implies that people 
differ markedly and consistently 1n 
the techniques they use for deriving 
the SP of an event from sequential 
information. 

Comparison of the methods of meas- 
urement —Because identical expen- 
ments had been done under all four 
conditions, it was possible to compare 
the methods of measurement wit 
Tegard to the kind of information 
transmitted. An analysis of variance 
Was performed on each of the 80 Ss, 
ividing the variance between his 
Tesponses into four components: (a) 
that attributable to differences be- 
cen the three sequence of lights, 

that attributable to differences be- 
oe the five outcome values, © that 

tributable to the interaction be- 
Ween Sequences and values, and (d) 


TABLE 2 


MEAN MEASURED SP FOR THE THREE 
DirrERENT SEQUENCES OF LIGHT 


Sequence 


33% A | 50% A | 67% A 


Guessing .29 3 38 
Rating 32 12 ERI 
Rating with incentive | .28 .69 .46 
Betting ES .65 51 


that attributable to differences be- 
tween the two replications of the 
experimental conditions. The four 
variance numbers (SSs) for each .S 
were then converted into proportions 
of the total variance for that S. This 
produced a matrix of 320 proportions, 
composed of four columns (corre- 
sponding to the components of vari- 
ance) and four groups of 20 rows 
(corresponding to the four conditions 
and the 20 Ss within each condition), 
with the proportions in every row 
summing to 1. Averages, computed 
across the 20 Ss in each group, pro- 
duced the data shown in Table 3. 
The methods clearly have different 
profiles. We can distinguish two 
groups: one consisting of the rating 
and rating with incentive methods, 
the other consisting of the guessing 
and betting methods. These groups 
differ primarily in the proportion of 
variance associated with sequences: 
differences between the sequences of 
lights having more effect on behavior 
for the first group of methods than for 
the second. In addition, the second 
group had a larger proportion of vari- 
ance associated with replications and 
a somewhat larger proportion with 
values. It was possible to test the 
significance of the difference between 
methods by performing a second anal- 
ysis of variance on the 320 proportions 
described just above. The interaction 
between methods of measurement and 


488 


DEAN G. PRUITT AND ROBERT D. HOGE 


TABLE 3 


AVERAGE PROPORTIONS OF VARIANCE ATTRIBUTABLE TO EACH 
or Four COMPONENTS OF VARIATION 


Components of Variation 


Cond. 
Values Sequences Val. X Seq. Replications 
i .0918 .6012 .1007 .2058 
on with inc. .0638 .5913 .0960 .2489 
Guessing .1142 .3496 .1860 .3502 
Betting -1037 -3706 1725 3528 


components of variation was highly 
significant, F (9,228) = 7.38, p < .001. 


DiscussioN 


The results provide new evidence 
about the relationship between the value 
of an event and predictions concerning 
that event but, unfortunately, do not 
enable us, as had been hoped, to accept 
or reject the hypothesis that value affects 
true SP. On the one hand, the emergence 
of a positive relationship between value 
and measured SP under three rather 
different methods of measurement might 
be thought to strengthen the contention 
that value affects true SP. On the other 
hand, the diminution of this relationship 
with increased pressure for accuracy 
seems to provide support for Rotter’s 
position. Actually, the implications of 
this latter evidence are themselves quite 
unclear. We might conclude, with 
Rotter, that pressures for accuracy affect 
the process through which SP is trans- 
lated into behavior, causing people to 
filter out other forces such as values and 
pay closer attention to their SPs. Or 
we might equally logically conclude from 
these findings that, under pressures for 
accuracy, people are stricter about the 
sources of their SP. Further research is 
needed to clarify this ambiguity. seid dt 

For all four conditions, a marked dis- 

crepancy was found beseni pine 
tion of times Light A SAS expe 


% a s C he meas- 
50 d 6797 sequences and t a 
red SP of Light A. These discrepancies 
u 


may have been due to the negative 
recency effect or '"'gambler's fallacy,” 
which makes people more likely to 
predict one kind of event the longer the 
run of the other kind of event that has 
just preceded (Jarvik, 1951). The 50% 
sequence ended in two flashes of Light B, 
which would lead to the perceived over- 
estimation of the probability of Light A 
if the negative recency effect were in 
operation. The 67% sequence ended in 
two flashes of Light A, which may help 
account for the underestimation of SP 
at the end of that sequence. The 33% 
sequence ended in a single flash of Light 
A, which would not be expected to 
produce much distortion. 

In comparing the methods of measure- 
ment, we found that people were more 
influenced by the sequences of lights in 
the rating methods than in the guessing 
or betting methods. The sequences of 
lights are, of course, the only evidence 
which is objectively relevant to proba- 
bility—the only information on which it 
is reasonable to base SP. We might, 
therefore, conclude that people are more 
influenced by objective evidence when 
making probability ratings than when 
guessing or betting. They may use this 
evidence wrongly and commit the gam- 
bler’s fallacy, but they are still more 
likely to use it. If this is true, it may 
make methodological sense to use rating 
rather than guessing or betting methods 
for studying reactions to objective prob- 
ability displays. 
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EFFECT OF PAIRING DIRECTIONALITY AND 
ANTICIPATORY CUE IN PAIRED-ASSOCIATE 
LEARNING! 

JAMES F. Voss 
University of Pittsburgh 


Each of 4 groups was presented a list of 8 paired-associate trigrams for 20 
trials. A 2 X 2 factorial design was employed in which stimulus-cue 
variation and pairing direction were varied. The 4 experimental con- 


ditions thus were: (a) A, A-B; (b) A, 


A-B; B, A-B; (c) A, A-B; A, B-A; 


(d) A, A-B; B, A-B; A, B-A; B, B-A. It was found that cue variation 
yielded significantly fewer correct Tesponses. Further analyses of 
response learning, associative learning, correct response /opportunity 
ratio, and errors suggested that this decrement was due to the necessity 


of learning both items of cach pair as responses, 
variation had a smaller, yet significant effect which 
trusions. No interactions were found. 


The paired-associate ( PA) anticipa- 
tion method consists of presentation 
of a cue item (A) followed by the pair- 
ing of the cue item and another item 
(A-B)? Experimental results (Feld- 
man & Underwood, 1957; Jantz & 
Underwood, 1958; Murdock, 1956, 
1958; Richardson, 1960) have demon- 
strated that not only A-B associative 
strength develops in PA anticipation 
learning, but backward associative 
strength, i.e., B-A, also develops. 

The issue of bidirectional associative 
strength has been pursued further by 
Asch and Ebenholtz (1962), Under- 
wood and Keppel (1963), and Schilde 
(1963). Asch and Ebenholtz, using 
the anticipation and recall methods of 
PA learning, interpreted their results 


1This research was supported by the 
National Institute of Mental Health (MH- 
08291-01) and Child Health and Human 
Development (HD-00957-01). The author 
fishes to thank Patricia Polesiak for assist- 
ae in the gathering and analyzing of the 
s ically, forward learning has been 

h JPER and backward learning R-S. 
E s à the A-B nomenclature is employed 
DOW port in order to avoid confusing 
* iae such as “The R item is the 
stater 


stimulus -~ — 


Pairing direction 
was related to in- 


as indicating that associations are 
of equal bidirectional associative 
strength. Underwood and Keppel 
(1963) compared acquisition of a 
standard anticipation PA list to an 
experimental list in which the cue item 
and related pairing occurred as A, 
A-B on one-half of the trials and B, 
B-A on the other one-half of the trials. 
The results were interpreted as in- 
dicating that “bidirectional learning 
is a simple summation of time to learn 
forward and backward associations to 
the same level of strength [p. 474]." 
Using anticipation and recall pro- 
cedures and a design highly similar to 
Underwood and Keppel, Schilde found 
significantly more errors in the bi- 
directional condition. 

Consideration of the Underwood 
and Keppel paradigm indicates that 
the experimental list involved con- 
comitant variation of the item that 
was used as the"cue (A or B) with the 
direction of the pairing (A-B or B-A, 
respectively). The purpose of the 
present experiment was to determine 
the independent and joint effects of 


the cue item and direction of pairing 
variables. 
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TABLE 1 
» PERFORMANCE MEASURES OF THE EXPERIMENTAL CONDITIONS 
Column? 
Cond. 

1 | 2 | 3 | 4 | 5 | 6 7 8 9 10 
52.7 8.5 — 1.9 64.2 — 40.2 «57 19.3 2.7 
18.9 8.6 = 1.5 65.1 — 38.8 al 31.4 9.5 
39.7 8.6 15.9 2.5 64.6 | 92.1 34.3 43 17.9 3.5 
25.4 10.1 16.4 3.3 | 60.5 75.0 | 24.7 36 20.1 7.6 


^ The columns denote the following measures: 1, mean correct respon 2, mean trials to the first response; 
ranean trials to the first occurrence of the second response of each p; associative learning trials; 5, 
cent correct responses on the first response occurrence; 6, percent corre onses on the first occurrence of the 


Second response item of each pair; 7, percent correct responses on Trial » + 1; 8, mean correct response/opportunity 


ratio; : ded: T uti i 
atio; 9, percent intralist intrusions; 10, percent extralist intrusions. 


METHOD 


Procedure—Each S was presented eight 
Paired associates by the anticipation method 
for 20 trials. A different random order of the 
EM associates was used on each trial. A 
th "SCC. presentation rate was employed for 
Te cue and pairing presentations with a 4-sec. 
intertrial interval. Standard paired-associate 
anticipation instructions were used. 

, Anticipatory cue item (C) and direction- 
pity of pairing (D) were varied in a 2 X 2 
ctorial design. Condition CiDi involved 
Presentation of the same cue item and the 
came direction of pairing (A, A-B). Condition 

1D; was identical to Cond. CiD: except that 
n * pairings of each paired associate Were 
versed on one-half of the trials (A, A-B; 
and B A different random order of the ; 
assoc} -A pairings was employed for tici 
to ear Condition CsDi was oper 
assi Ond. C.D, except that Item A of each 
half of d was employed as the cue on cie- 
One ; the trials and Item B on the ot " 

we half of the trials (A, A-B; B, A-B). 5 
as erent random order was employed for eac 

Sociation with respect to A and B trials. 
exe Mdition C3D» was identical to Cond. Cibi 
sa ane that the cue item was varied on the 
na trials as Cond. CsD: and the pairing 

Fection was varied on the same trials as 
ond, C,D, (A, A-B; B, A-B; A, B-A; B, 
~A). The random order of the cue and 
i trials of each association was re- 
weed so that the cue and pairing direction 

ations were orthogonal. Thus, in Cond. 
dien each combination of cue and pairing 
Or e, ion was presented for 5 of the 20 trials 

ach association. diia 
laze trials. The A-B pairs were 41-5390 
Scaleq association value trigrams which were 
by Battig (1959) such that A-B associa- 


Pairing 


tive strength is of approximately the same 
magnitude as B-A associative strength. The 
trigrams were pronounced by Ss with articula- 
tion occurring only as anticipations. 

Subjects—The Ss were 60 introductory 
psychology students at the University of 
Pittsburgh and they were assigned to each 
of the four conditions (V = 15/condition) by 
use of a table of random numbers. 


RESULTS AND DISCUSSION 


Total correct responses —The first 
column of Table 1 presents the mean 
correct responses for the experimental 
conditions. An analysis of variance 
performed on the correct response fre- 
quency data revealed a significant C 
source of variation, F (1, 56) = 16.59, 
p < .01, and a significant D source of 
variation, F (1,56) = 4.10, p < .05. 
The C XD interaction source of 
variation is not significant. Thus, the 
findings indicate that use of only one 
item of each association as cue yielded 
considerably better performance than 
use of both items as cues and undirec- 
tional pairing presentation yielded 
better performance than bidirectional 
pairing presentation. 

Response learning.—Response learn- 
ing was measured by tabulating the 
trials to S’s first response occurrence 
of each association, regardless of 
whether the response was correct or 
incorrect (Postman, 1962). Before pre- 
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senting the data, however, a method- 
ological issue must be considered. 

When A and B were employed as 
cues, response learning was required 
for both items. Two tabulations 
therefore were performed for Cond. 
CD, and CD. The first was trials 
to the occurrence of the first response, 
regardless of whether it was A or B; 
the second was trials to the first occur- 
rence of both items as responses, A 
and B. 

Column 2 of Table 1 presents mean 
trials to the first response for the four 
experimental conditions. Any re- 
sponse not given was designated Trial 
21. An analysis of variance performed 
upon the trials to the first response 
data revealed that the C, D, and 
C X D sources of variation are not 
significant. 

Column 3 of Table 1 presents mean 
trials to the first occurrence of A and 
B as responses. Any response not 
given again was designated Trial 21. 
It should be noted that the mean 
trials from first to second response 
learning was 7.3 for Cond. C.D, and 
6.3 for Cond. CDs. An analysis of 
variance performed upon the C.D, 
and CD» second response data com- 
bined with the C;D; and C,D; data 
revealed a significant C source of 
variation, F (1, 56) = 45.05, p < .01. 
'The D and C X D sources of variation 
are not significant. It should be 
pointed out that since either A or B 
was presented as cue between the 
occurrence of the first and second re- 
sponses in Cond. CD; and CD», the 
opportunity for the second item to 
occur as a response was a function of 
the trials on which it was a response 
rather than a cue. | The data indicate 
that of the 7.3 trials in Cond. CD; 

een the first- and second-item 
DE er learning, a mean of 3.8 trials 
K 3» which the response was 
page response item of the 6.3 
es in Cond. C2D2 between the first- 
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and second-item response learning, a 
mean of 3.5 trials occurred in which 
the response was the second. response 
item. 

In summary, response learning does 
not vary significantly as a function of 
cue variation when the first response 
in Cond. CD; and C.D» is considered. 
However, response learning takes 
significantly more trials in Cond. C4D; 
and CD; than C,D, and CDs when 
both items of each pair are considered. 
In addition, pairing directionality 
does not significantly influence re- 
sponse learning. 

Associative  learming.—Associative 
learning was measured by tabulating 
the following difference: trials to the 
first correct response minus trials to 
the first occurrence of that particular 
response (response learning) for each 
pair (Underwood & Schulz, 1960). 
Column 4 of Table 1 presents the 
mean associative learning trials for 
each experimental condition. The 
means of Cond. CD; and CD, refer 
to the first item of each pair correctly 
associated. (This item was almost 
without exception the first item 
learned as a response.) 

The mean associative learning trials 
for each .S was computed and the data 
were submitted to an analysis of vari- 
ance. The C source of variation is 
significant, F (1, 56) = 8.40, p < .01. 
The D and C X D sources of variation 
are not significant. 

Since the opportunity for correct 
association was less in Cond. CD: 
and CD; than in Cond. C,D, and 
CiD: because of the cue variation, the 
trials for each association were tabu- 
lated on which the item given previ- 
ously as a response could be given as 
a correct association. The data in- 
dicate that the mean associative 
learning trials, taking into account 
opportunity for a correct association, 
were 1.3 in Cond. CD, and 1.3 in 
Cond. CiDs. These data were con- 
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sidered with the C;D; and CiD: data 
and submitted to an analysis of vari- 
ance, The C, D, and C X*D sources 
of variation are not significant. These 
results indicate that the significant 
effect of cue variation upon associative 
learning is attributable to the lack of 
Opportunity that cue variation pro- 
vides for a correct association to be 
made. 

Associative learning trials 
tabulated for the second item of cach 
Pair in Cond, CaD, and CDe. The 


were 


Mean associative learning trials were 
0.3 in Cond. CD, and 1.2 in Cond. 
2D 


C 


» These data indicate that 
Sociative learning of the second re- 
SPonse item was more rapid than that 
Of the first item. An analysis of vari- 
ance was not performed on these data 
"ecause of the restriction produced by 
the relatively large frequency of re- 
Sponses that were never associated 
correctly (47 of 120 in Cond. C2D1 and 
98 of 120 in Cond. CoD). 
Associative learning and its relation 

response learning were studied 
further by consideration of the follow- 
‘Ng question: what percentage of first 
Fesponse occurrences was correct re- 
Sponses? Column 5 of Table 1 
Presents the percent correct responses 
Siven on the first occurrence of the 
response, The percent of Cond. CsDi 
ied CDs represents the first response 
‘tem learned. The percent correct 
responses on the first response Occur- 
trie was computed for each S, sub- 

ed to an arcsin transformation, 
nud the transformed data were sub- 
Tied to an analysis of variance. 
variat qn > and C des “a of 
` are not significant. 

‘olumn 6 of Table 1 presents the 
i correct responses that were 
itis im the first poen Je a 
and C response item in Con ? ani 
Mitte "ü "an The data were no ÁE 
Cans, LO an analysis ol variance 

Se of the relatively large number of 


to 
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Ss that gave all correct responses on 
the first occurrence of that particular 
response (7 of 15 in Cond. CoD, and 
6 of 15 in Cond. CD»). 

In summary, the associative learn- 
ing data indicate: (a) associative 
learning of the first item in Cond. 
CD; and CoD» is poorer than CyD, 
and CD», but this difference is due 
primarily to the lack of opportunity 
in the former conditions to associate 
the first response item correctly; 
(b) associative learning of the second 
response item in Cond. CD; and CoD: 
was quite rapid; (c) pairing direction 
yielded no significant effect upon as- 
sociative learning. 

Post-correct-response irial.—The 
trial on which a response is first given 
correctly is denoted Trial 7 and the 
subsequent trial, z + 1. Column 7 of 
Table 1 presents the percent correct 
responses on Trial a + 1 with the first 
response item considered in Cond. 
CoD, and CDs. The percent correct 
on Trial n + 1 was tabulated for each 
S, submitted to an arcsin transforma- 
tion, and the transformed data were 
submitted to an analysis of variance. 
Despite the relatively large group 
differences, the C, D, and CX D 
sources of variation are not significant. 
It should be noted that this method of 
tabulation weights each S equally, but 
the number of z + 1 trials within a 
given Sis not differentially considered. 
© The percent correct responses on 
Trial n + 1 in Cond. CD; and C:Ds 
was tabulated when the same cue item 
was presented on Trials n and n + 1 
and when the Trial 2 + 1 cue item was 
different from the Trial # cue item. 
In Cond. C2D,, the percent correct 
responses for the same cue item was 
43.2 and for the second cue item was 
26.6; in Cond. CD, the percent 
correct responses for the same cue 
item was 35.7 and for the second cue 
item was 18.9. The percent correct 
on Trial n + 1 for each S under the 
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same and second cue conditions was 
submitted to an arcsin transformation 
and a / test was performed on the 
transformed data. In Cond. C;D;, no 
significant difference was found be- 
tween percent correct on z + 1 of the 
same cue and second cue occurrence, 
t (26) = 1.71, p> .05.2 However, 
significantly greater percent correct 
responses occurred in Cond. C.D. 
when the same cue item occurred on 
Trials n and n 4-1, £ (26) = 1.84, 
p < .05. 

The z + 1 data thus indicate that: 
(a) use of both items of a pair as cues 
yielded a tendency toward fewer cor- 
rect responses on Trial n + 1, espe- 
cially when the cue item changed over 
Trials 2 and n + 1; (5) pairing direc- 
tion had no significant effect. 

Correct opportunity ratio.—The ratio 
of correct responses following the first 
correct response + opportunity for 
correct responses after the first correct 
response was tabulated for each pair. 
The mean ratio for each experimental 
condition is presented in Column 8 of 
Table 1. The mean ratio for each § 
was submitted to an arcsin trans- 
formation and the transformed data 
were submitted to an analysis of 
variance. The C source of variation is 
significant, F (1, 56) = 11.92, p < .01. 
The D and C X D sources of variation 
are not significant. These results in- 
dicate that cue variation yielded a 
significantly detrimental effect upon 
correct responses when the oppor- 
tunity for correct responses is taken 
into account. . . 

In order to isolate this detrimenta] 
effect, the correct/opportunity ratio 
was tabulated for each of the two 
response items in Cond. C.D, and 
CoD». In Cond. C2D,, the mean C/O 
ratios were .54 and .63 for the first and 


3 The data of one S in each group are not 

sidered in the analysis because the same 
P occurred on the z and x + 1 trials. 
cue 
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second response items, respectively; 
in Cond. C4Ds, the mean C/O ratios 
were .49 and .65 for the first and 
second response items, respectively. 
These ratios indicate that first, the 
C/O ratio for the four experimental 
conditions is quite comparable when 
the first response item is considered in 
Cond. C.D, and CD. Second, the 
C/O ratio for the second response item 
in Cond. C;D; and C.D, is greater 
than that of the first item. These 
findings lead to the conclusion that 
the significant decrement in C/O ratio 
performance due to C variation is not 
due to either a poorer ratio of correct 
responses of the first or second re- 
sponse item, but to the lack of correct 
responses between the occurrence of 
the first correct response and the 
occurrence of the second correct re- 
sponse. Furthermore, since the first 
item C/O ratio is comparable to the 
C/O ratio in Cond, C,D, and CiDs 
the decrement in C/O ratio must be 
attributed to the lack of correct 
responses for the second response item 
prior to the first correct anticipation 
of the second response item. This 
finding is quite related to the response- 
learning data since the slow learning 
of the Second response item neces- 
sarily means that it cannot be asso- 
ciated correctly and therefore correct 
response frequency must be lower in 
the cue variation conditions. 
Errors.—Columns 9 and 10 of Table 
Present the mean intralist and 
extralist intrusions, respectively. An 
analysis of variance performed upon 
the intralist intrusion data revealed a 
Significant C Source of variation, 
F (1, 56) — 4.06, b < .05, and a sig- 
nificant D source of variation, F (1,56) 
= 3:05, ð: « 05. The C X D source 
of variation is not significant. An 
analysis of variance performed upon 
the extralist intrusion data revealed a 
Significant D Source of variation, 
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F (1,56) = 12.01, p< .01. The C 
and C X D sources of variation are 
not significant. 

The intralist and extralist intrusion 
data were divided into errors prior to 
and after the first correct response. 
Analyses of variance revealed a signifi- 
cant D source of variation for intralist 
intrusions prior to the first correct re- 
sponse, P (1, 56) = 9.3, p < -01, 
extralist intrusions prior to the first 
Correct response, F (1, 56) = 10.4, 
P < .01, and extralist intrusions after 
the first correct response, F (1, 56) 
= 6.26, p < .01. No other sources of 
Variation are significant. 

The intralist and extralist intrusion 
results therefore indicate: (a) pairing 
direction variation yielded signifi- 
Cantly poorer performance attribut- 
able to intralist and extralist intru- 
Sions; (b) cue variation had an effect 
of borderline significance. 


_Conclusion.—The following conclusions 
With respect to cue variation appear 
Warranted from the above data: (a) use 
a both items of a paired associate as 
anticipatory cues yields a decrement in 
Correct responses; (b) this decrement 15 
directly related to the difficulty of re- 
Sponse learning of the second item of the 
ven (c) associative learning of the first 
sponse item of the pair is retarded, but 

MS loss is negated when opportunity for 
Pairing is considered; (4) associative 
Sarning of the second response item is 
Quite rapid. In addition, it may be con- 
b aded that pairing direction had a small, 
a Significantly detrimental effect upon 
re] “ect response performance that was 
un to intralist and extralist intru- 
"rend This finding suggests that pairing 
fter on variation produces difficulty in 

m discriminability- 
co he present results are not directly 

Mparable to the findings of Underwood 

Keppel (1963) and Schilde (1963) 

cause of differences in experimental 

a However, the present findings 
cue o Strate the independent effects of 

and pairing variations. In particu- 


lar, the cue variation results show the 
importance of response learning in paired- 
associate experiments where either item 
of a pair is employed as the cue (Asch & 
Ebenholtz, 1962). In addition, the find- 
ing of rapid, although not immediate, 
associative learning of the second re- 
sponse item is in general agreement with 
the bidirectional association position. 
On the other hand, the finding of poorer 
correct response and error performance 
with bidirectional pairing presentation 
suggests that unidirectional and bi- 
directional pairing are not equivalent, 
and therefore the associations are not 
learned bidirectionally. 
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850 unsophisticated Os cach made multiple e 
fractional estimates of lightness on the same 
white standard called 1 or 100, respectively, 
median estimates were approximately lincar against reflectance. 
multiple estimates were lincar against the 
a reciprocal relationship is predicted by transducer the 
depended upon the order of the variables: the ratio of th 
smallest slope ranged between 2.1 and 1.2 to | (p 
Conclusion; unsophisticated Os can estima 
tematically, but the nature and size of the un 
by present and prior experimental conditions, 


It was reported in a previous paper 
(Poulton & Simmonds, 1963) that the 
very first magnitude estimates of 
reflectance are influenced by the range 
of numbers available to O. The pres- 
ent experiment investigated this effect 
in greater detail for variables judged 
against a fixed standard. 

By requiring different groups of 
unsophisticated Os to judge the vari- 
ables in different fixed. Orders, it was 
also possible to examine the effects 
of prior context more thoroughly than 
has been done previously. Nothing 
appears to be known about the effect 


1 Carried out at the Applied Psychology 
Research Unit, Cambridge, England. The 
eo h was supported by the British Medical 
censi? Council. by the United Sate Air 
ame Cities of Scientific Research of the 
TT arch and Development Command 
vs os European Office, and by the 
A Talence Foundation. We are also 
e ilie authorities and student volun- 
age ne Cambridge colleges in which the 
one vica were conducted, and to P. Frec- 
experiments ii 
an Tor statistical advice. 


stimates of darkness or 
gray variables against a 

On a normal plot the 
The 
fractional estimates, whereas 
ory. The slope 
€ greatest to the 
< 01 or better). 
ate sensory magnitudes sys- 
its they use are determined 


of multiple estimates upon each other. 
For fractional estimates it has been 
found that an estimate can be changed 
reliably (p < 001) by a prior frac- 
tional estimate (data from Warren & 
Poulton, 1960). However it is pos- 
sible that such effects are transient, 
and cancel out when a series of vari- 
ables is used, The results reported 
here Suggest that this js not the case. 


METHOD 


Observers.—These were mainly male under- 
graduates of Cambridge University, although 
a few graduates and women students were 
used. Psychology students and those who 
had served in previous experiments were ex- 
À sample of 175 of their ages ranged 
from 18 to 28 with a median of 21. 

Display.—A Piece of white paper 17.5 
X 11.3 in. with a reflectance of 83% served as 
a background. It lay ona table directly in 
front of O, arranged symmetrically with a 
long side closest to him. For the groups of 
Os making four fractional judgments of light- 
ness it was labeled 100 in pencil in the left- 
hand corner; for all multiple judgments of 


496 


CONTEXT AND ME 


darkness it was labeled 1. In the middle of 
the white paper was placed a gray stimulus 
Patch 2.0 X 1.5 in. with its long side parallel 
to the long side of the background. The 
Percentage reflectance of the patch was 68, 58, 
34, 19, 7.8, or 3.4. The white paper back- 
ground and small patches were coated with 
nonglossy paint. Their reflectances were 
measured both with a Macbeth illuminometer 
and with an illuminometer made by Salford 
Electrical Instruments Limited. As previ- 
ously (Poulton & Simmonds, 1963), the light- 
Ing on the table was at the ambient level to 
which O had become adapted. 

. Experimental conditions.—Vach O judged 
in turn two, three, or four grays. The reflect- 
ances and the orders of presentation of the 
groups making four judgments are shown in 
able 1, ‘The same reflectances and orders 
re used for both the multiple and fractional 
estimates; thus the multiple estimates of 
aroup A correspond to the fractional esti- 
Mates of Group E, and so on. ach O made 
either multiple or fractional estimates, never 
both, 

Procedure.— The experiment was con- 
ducted in Cambridge Colleges. One E 
canvassed for volunte and kept curious 
Eyes off the display. When O sat down at the 
table, the second Æ handed him a white card 

5 in. with instructions typed on it, à 
Pencil, and a white answer slip 4 X 3in. For 
Tactiona] judgments of lightness the in- 
Structions read: “You will sec a gray patch of 
Paper lying on a white blotter. If the light- 
Ress of the white blotter is called 100, what 
Smaller) number describes the lightness of 
the gray patch? Ple write down your 
answer on the slip of paper provided.” The 
structions were explained if necessary by 
Paraphrasing them. For multiple judgments 
of darkness “white” was changed io light 
M" "lightness" to “darkness,” í we A 
to «4 20d "smaller" to “larger. a ei T DS 
whe 8t gray” was made to ipic ohi 
bs Were unwilling to accept à num 
an Zero for “white.”) +e first 
d Soon as O had written pes DM 
slg pnl estimate, the p a Bie his 
Secunda. Patch and skec E M ‘The third and 
answer below his first. e 
Ourth tten on the reverse 
Side of of the Os who 
two judgments were ARE 
is Finally O was requested gine boar 
answers with anyone who had not already 
Ne the x This procedure was 
Tepe volunteers until a 
Brou s had been 


answers were wri 


F experiment. 
ated with successive 
P of 50 acceptable answer 
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obtained. Answers which did not satisfy the 
instructions were discarded. 

Calculations.— Two distinct median meas- 
ures were calculated. The group median 
plotted in the figures was the median of all 
the Os in the group. They generally num- 
bered 50, but in Fig. 2b some of the medians 
were computed from the pooled scores of two, 
or in one case three, groups of 50 Os. The 
mean median of Table 1 was calculated from 
the medians of the five subgroups comprising 
the first and last five Os in the group, the 
second and one-from-last five Os, and so on. 
Its 56% fiducial limits are an index of the lack 
of precision in the median which is tolerated 
when using experimental groups of only 10 
Os. The straight lines in the figures were 
fitted by the method of least squares. This 
method was also followed in calculating the 
quadratic terms. — Two-tailed have 
always been used. 


tests 


RESULTS 


The group medians, mean medians, 
and their 56; fiducial limits for the 
groups of 50 Os which made four 
multiple estimates of darkness are 
shown in the top half of Table 1. 
Figure la shows the group medians 
plotted against reflectance on a normal 
plot, while Fig. 1b is the corresponding 
log-log plot. The straight lines in 
Fig. la were fitted by the method of 
least squares. Only the quadratic 
term of Group B was reliable, and 
here reliability reached only the .05 
level of significance. No attempt was 
made to fit the points in Fig. 1b by 
straight lines. The functions for the 
reciprocals of the medians (not shown) 
were also markedly curvilinear on a 
log-log plot. . 

The slopes of the lines in Fig. 1a, 
together with their 5% fiducial limits, 
are given in the one from last column 
of Table 1. The slope of each group 
appears to have been determined by 
the reflectance of the very first vari- 
able. The slope of Group A which 
judged first the variable of 68% 
reflectance, was reliably smaller than 
the slopes of Groups B and C which 
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Fig. 1. Median numerical estimates of groups of 50 Os. (Figures la through 1d plotted 
against the reflectance of the variable; Fig. 1e multiple estimates plotted against the fractional 
estimates of different Os judging the same variables in the same order. Except for Fig. 1b, a 
Straight line passing through the standard has been fitted to the data of each group or pair of 


8roups by the method of least squares.) 


The corresponding data for frac- 
tional estimates are given in the 
reliably smaller than the slope of bottom half of Table 1 and in Fig. 1c 
Toup D which judged the variable andd. In both the normal and the 
gt 7.8% reflectance first. The ratio of log-log coordinates the fitted straight 


the Slopes of Groups D and A was lines shown left quadratic terms which 
Over 1.8 to1 were unreliably statistically. Here 


Judged first the variable of 44% re- 
Sctance, And these in turn were 
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Fic. 2. Median numerical estimates of the 
very first judgments of groups of at least 50 
Os plotted against the reflectance of the 
variable (Fig. 2a); and multiple estimates 
plotted against fractional estimates (Fig. 2b). 
(The straight lines passing through the stand- 
ard have been fitted by the method of least 
squares.) 


the slopes, which are given in the last 
two columns of Table l, appear to 
depend upon the reflectance of the 
first variable which was not near the 
middle of the range of reflectances, 
In both sets of coordinates the slope 
of Group E which was presented first 
with the variable of 68% reflectance, 
was reliably greater than the slope of 
Group H which judged the variable 
of 7.8% reflectance first. Of the two 
groups which first judged the variable 
of 44°7 reflectance, Group F Which 
followed this by the variable of 68% 
reflectance had a reliably greater TE 
than Group G which judged Ru 
the variable of 7.8 néfieelniod, p S 
-atio of the slopes of Groups E and G 
en over 1.2 to 1 on the normal plot, 
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and over 1.6 to 1 on the log-log plot. 
In both cases it was in the opposite 
direction to the ratio of the slopes of 
Groups A and D. 

Figure 1e shows the median mul- 
tiple estimates plotted against the 
median fractional estimates of the 
groups which judged the same vari- 
ables in the same order. For each 
pair of groups the straight line fitted 
to the data left a quadratic term which 
was not reliable (5 > .05). The slope 
of the combined Groups D and H, 
—.30 = .02, was reliably the largest, 
and the slope of Groups A and E, 
—.14 + .01, was reliably the smallest 
(b < .02 or better in each case). The 
ratio of these two slopes was over 2.1 
to 1. When the reciprocals of the 
median multiple estimates were plot- 
ted against the fractional estimates 
and vice versa, the functions „(not 
shown) were all markedly curvilinear 
with both normal and log-log co- 
ordinates. 7 

Figure 2 shows the corresponding 
data for the very first judgments of 
groups of at least 50 Os. The straight 
line in Fig. 2a fitted to the median 
multiple estimates had a slope of 
—.20 + .01. The quadratic term was 
just reliable (b < .05), indicating a 
slight concavity upwards as for Group 
B. On a log-log plot the function 
was markedly curvilinear, resembling 
those in Fig. 1b. 

The straight line fitted to the 
median fractional estimates had a 
slope of 1.08 + .03 on the normal plot 
of Fig. 2a, and a slope of .76 + .05 
on a log-log plot (not shown). In both 
cases the quadratic term was not 
reliable (b> .05). Figure 2b shows 
the linear relationship between the 
Very first multiple and the very first 
fractional estimates. The fitted line 
had a slope of —.27 4.02; the 
quadratic term was not reliable. 
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Discussion 


Linear relationships on a normal plot.— 
Figures la, Ic, and 2a show that the 
estimates of both darkness and lightness 
were approximately linear when plotted 
against reflectance on a xormal plot. The 
results of fractional estimates of lightness 
are distantly related to the data of 
Stevens and Galanter (1957, Fig. 12) and 
of Torgerson (1960). For the slopes are 
all fairly close to unity; thus points lying 
close to one of the straight lines also lie 
close toa straight line on the correspond- 
ing log-log plot of Fig. 1d. The fitted 
exponents of Fig. 1d, which range from 
61 x .04 to .98 + .05, are probably relic 
P5 smaller than the value reported by 
Stevens and Galanter and by Torgerson 
of 1.2, although these authors give no 
Confidence limits. But the higher value 
mn perhaps be attributed to the multiple 
fr imates which their Os made as well as 
oe since prior multiple estimates 
a be expected to increase the size of 
(x exponent for fractional estimates 
(Poulton & Simmonds, 1963). 
N Eun. the results for multiple 
to tetrad of darkness cannot be related 
seas à oo results, since they are by no 
fecri ARR when plotted against re- 
peice on m lepleg plt (Eig. 1b) 
whia the reciprocals of the medians, 
1961a on ee theory (Steve 
tional p 27) should correspond to frac- 
straigl estimates of lightness, lie on à 
Pines line when plotted against re- 
ance on cither a normal or a log-log 
plot, 
in "ur of prior context. — The differences 
Slope in Fig. 1a and c suggest that 
Prior context can have considerably more 
Seneral and more pronounced effects on 
Magnitude estimates than has so far 
Cen realized (Stevens, 1956; Stevens & 
not aon, 1956). The effects shown were 
too PS simply to selecting a first variable 
of y ar to one end or other of the range 
Star ee For Groups F and G both 
ine with a variable of 44% reflect- 
range Which is near the middle of the 
di =, yet their slopes were reliably 
log Peg on both the normal and the log- 
Plot (Table 1). Nor were the effects 
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due simply to the particular selection of 
variables, for the reflectances covered 
most of theavailable range. Also Groups 
A and B actually had an identical set of 
variables to judge, as did Groups C and 
D, yet in each case the fitted slopes were 
reliably different. Figures la and c show 
that in general the fits of the straight 
lines were good. Thus the groups were 
reasonably consistent in the differences 
which they had acquired. 

Table 1 shows that for the multiple 
estimates of darkness the ratio of the 
largest slope to the smallest was over 1.8 
to 1. For the fractional estimates of 
lightness the ratio was over 1.2 to 1 on 
the linear plot, and over 1.6 to 1 on the 
log-log plot. Differences in slope of this 
order of size cast more doubt upon the 
representativeness of the slopes for 
different sensory continua obtained by 
magnitude estimation (Stevens 1960a, 
1960b, 1961a, 1961b) especially when 
the order in which the variables were 
judged is not stated. 
ect of number scale.—On a simple 
transducer theory (Stevens, 1961a, p. 27) 
the multiple estimates of darkness should 
have borne a reciprocal relationship to 
the fractional estimates of lightness, since 
the stimuli were identical. Figures leand 
2b show that instead the relationship was 
Apparently if Æ presents un- 
d Os with a range of reflect- 
they will 


lincar. 
sophisticate 
ances and a number scale, 
arrange the reflectances more or less 
linearly along the number scale whatever 
form the number scale takes. This 
ocation of numbers to reflectances is 
systematic (see Fig. 1a, 1c, and 2a), but 
it cannot be said to be "magnitude 
estimation" when as here one number 
scale is related reciprocally to the other. 
Clearly Os must be able to appreciate 
sensory magnitudes in order to be sys- 
tematic. But it would appear that a 
“magnitude estimation” experiment does 
not necessarily indicate the nature of Os’ 
units of judgment, since they are a 
function of the number scale Æ tells 
them to use. 

That this has not been reported previ- 
ously must be because no E has ever 
required unsophisticated Os to make a 


all 


502 POULTON, SIMMONDS, WARREN, AND WEBSTER 


number of multiple estimates without 
also making or having made fractional 
estimates. And a multiple estimate fol- 
lowing a fractional estimate can be 
reliably different from a multiple esti- 
mate made as the very first judgment of 
magnitude (Poulton & Simmonds, 1963). 
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LOUDNESS, A PRODUCT OF VOLUME TIMES DENSITY ! 


S. S. STEVENS, MIGUELINA GUIRAO, AND A. WAYNE SLAWSON 


Laboratory of Psychophysics, Harvard University 


Experiments were designed to determine the relation between loudness 
and 2 other auditory attributes, volume (apparent size) and density 
(apparent compactness or concentration). 2 sets of stimuli, quarter- 
octave bands of noise covering a wide range of center frequencies and 
SPL, were presented through earphones to Os who made magnitude 
estimations of one or another of the attributes. The loudness estima- 


tions were plotted against loudne: 


ss level and found to agree with the 


sone scale. A plot of the estimations of loudness against the product 


of the estimations of volume times 
a slope of 1.0 in log-log coordinates. 
to volume times density. This rel 


involving magnitude estimati 


the estimations of density produced 


Loudness is therefore proportional 


ation was confirmed by experiments 
ons of the inverse attributes, softness, 


smallness, and diffuseness. These inverse attributes were found to be 


the reciprocals of their respective 


ness turned out to be proportional 
diffuseness. Thus magnitude estim 


their inverse I 
product of volume times density. 


One of the earliest criticisms to 
ret the experimental demonstration 
(Stevens, 1934) that Os can judge the 

ensity of tones was the conjecture 
that tonal volume (apparent size or 
argeness) and tonal density (com- 
ee or concentration) are merely 
he inverse of each other and not two 
lifferent attributes. It was noted 
that when the frequency is raised 
Volume decreases and density in- 
oe which suggests a reciprocal or 

Verse relation. But both volume 
“nd density increase with stimulus 
intensity, which seems to deny the 
‘verse relation. Clearly the problem 
^ More complex than was implied by 
Ome of the early discussion. 

a he successful scaling of the two 
Uditory attributes, volume (Terrace 

Stevens, 1962) and density (Guirao 
X Stevens, 1964), reopened the ques- 
‘on of the interrelations among the 


leppe . 
This research was supported in part by 


direct attributes. As predicted, soft- 
to the product of smallness times 
ations of both the attributes and 
s established the proportionality between loudness and the 


attributes. Those studies showed 
that both volume and density grow as 
power functions of sound pressure. 
The point of interest in the present 
context is the behavior of the expon- 
ents of the power functions. The ex- 
ponents of the volume functions grew 
larger with frequency and the expon- 
ents of the density functions grew 
smaller. There was, in fact, a sug- 
gestion that at cach frequency the two 
exponents may add approximately to 
a constant sum, a sum not greatly 
different from 0.6, the exponent of the 


loudness function. 
A decision was therefore made to 


explore more directly the relations 
among volume, density, and loudness. 
What appeared to be needed was a 
series of experiments in which the 
stimuli would be narrow bands of noise 
and the same set of stimuli would be 
used throughout a given series of ex- 
periments. Bands of noise were 
chosen over pure tones mainly because 
Os seem able to judge noises with 
greater certainty than tones. Pure 
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TABLE 


ss (GM) AND QUARTILE DEVIATIONS (QD IN DECILOGS) OF THE MAGNITUDE 


GEOMETRIC ME? 
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1 


ESTIMATIONS OF THREE ATTRIBUTES OF NARROW BANDS OF NOISE 
Noise Bands Loudness Volume Density 
Freq. SPL GM oD GM oD GM oD 
Series A 
200 55 1:5 1.5 26.6 3.5 L3 1.5 
325 86 31.3 0.8 51.2 0.7 11.4 2.3 
450 68 14.1 1.5 19.9 1.5 5.4 1.5 
653 51 3.6 1.1 12.3 1.9 2.5 1.5 
653 100 76.5 1.4 59.8 2.0 33.7 1.0 
1,000 75 20.9 1.5 13.2 2.0 17.7 0.6 
1,630 89 39.9 1.8 244 3.0 36.3 2.7 
2,000 45 3.5 1.8 4.7 1.1 6.4 2.0 
4,000 95 77.7 1.8 15.2 4.0 87.6 2.0 
6,000 63 5.8 2.0 3.8 11 28.4 24 
Series B 
200 50 1.0 1.5 18.3 2.0 0.8 2.2 
325 80 23.3 2.0 27.7 1.6 9.2 2.2 
720 100 79.4 1.1 38.0 2.7 51.0 1.1 
720 42 1.9 0.9 8.1 2.0 2.1 0.3 
1,000 70 12.9 1d 11.7 0.3 13.1 0.4 
2,580 90 64.2 1.5 12.5 3.3 60.4 0.8 
3,000 50 6.3 m 46 0.9 8.8 1.8 
6,000 70 12.6 1. 2.9 2.0 217 1.9 
Series C g 
200 50 
325 80 39 SA He Zi 
mm *SHBOSYIB 
T4 are) ^ * E 
1,000 70 99 [> OS 136 i 1 
2,580 90 182 15 61.6 11 
3,000 50 32 11 16.1 2.0 
6,000 70 82 3.0 40.8 0.9 


Note.— The center frequency and sound pressure level are 
65 db. SPL was presented at the beginning of each run and cal 


Series C. 


tones have an indefiniteness about 
them that makes even their loudness 
less easy to judge than the loudness 
of a band of noise (Stevens, 1956). 
In addition to the scaling of loud- 
ness, volume, and density by magni- 
tude estimation, the inverse of each 
of the attributes was scaled. If the 
same functional relation among the 
attributes can be established by meas- 
uring the reciprocals as well as the 
attributes themselves, the nature of 


given for each band. A standard at 910 cps and 
lled 10." No loudness judgments were made in 


the relation becomes more securely 
established. 


METHOD 


The procedure was essentially the same as 
that followed in the earlier experiments on 
density (Guirao & Stevens, 1964). Bands of 
noise were generated by filtering a white noise 
with the aid of a continuously variable, con- 
stant K, filter network (Allison 2B). Fora 
given stimulus the high- and low-pass sections 
of the filter were set to the same frequency, 
which produced a band about a quarter of an 
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Magnitude estimation of 
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yp ALL is 
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Loudness level in phons 


"RSS d. Relation of loudnes 
estimatic two different experime 
tinal ions, 2 by each of 10 Os. 

us frequency and SPL [T4 


tours. 7p 
5. The line P 
scale.) line through the dat 


The abscis 


Octa 
fe Widle ata level 3 db. down from i 
PPR.g he stimuli were delivered through 
earphones mounted in sponge neo- 
cushidns MX-41/AR. The calibration 
them py carphones in a 6-cc coupler showed 
Charact, em have a reasonably flat pospone 
inis e ee over the range of frequen ; 
Phones * The voltage across the two, n 
a bs Connected in series Was cad with a 

/ od Ntine root-mean-square voltmeter, 
| 320, and converted into sound pressure 

- on the basis that 1 v- corresponds 


en Os chosen more or less at random [rom 


amo, 
"E 30 students and staff of the laboratory 


Se 
Pree each experiment. The stimuli were 
differ. ed twice each in irregular order, à 
Centered order for each O. A band of noise 
Presen at 910 cps and 65 db. SPL was 
ang ted at the outset of each experiment 


assi E : 
alloy ened the value 10. [he O was 
h to hear the standard again whenever 


s reque n ^ - E 
Sted it. Two sets of stimuli varying 


estimations to loudness level. 

The ordinate of each point is the geometric mean of 20 
sa of each point was determined by converting the 
ible 17 into loudne 
a shows the slope 


(The squares and circles 


sss level with the aid of equal-loudness con- 
[exponent] predicted by the standard sone 


widely in both frequency and SPL were used. 
The center frequencies and the SPL of the 
bands are tabulated in Table 1. 

The instructions were as follows, with 
appropriate variations for the different attri- 
butes—loudness, volume, ordensity—and their 
ss, smallness, and diffuseness 


inverses—softne 
I am going to present a series of noises. 
Your task is to judge the loudness of each 
noise. The loudness of the first noise will 
be called 10. Assign to each of the succeed- 
ing noises a number proportional to its 
apparent loudness, remembering that the 
loudness of the first noise was called 10. 
For example, if the second noise sounds four 
times as loud, call it 40; if half as loud, call 
it 5, and so forth. 


Further definitions were also included in 
the instructions for most of the attributes. 
Volume was said to refer to the size of the 
sound, i.e., how large or small it appears to be. 
Density was said to refer to the compactness, 
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concentration, or hardness of the sound. 
Diffuseness was said to refer to how tenuous, 
thin, or rarefied the sound appears to be 


RESULTS 
Loudness 

A first concern is whether Os can 
make satisfactory judgments of loud- 
ness when the stimuli vary in fre- 
quency as well as in sound pressure. 
Two separate experiments were run, 
each with a different set of stimuli. 
The geometric means of the magni- 
tude estimations are shown in Table 1, 
Series A and B. In order to compare 
these data with the standard loudness 
function, each SPL was converted to 
its corresponding loudness level by 
means of the equal-loudness contours 
presented by Stevens and Davis 
(1938). These contours are based on 
the equal-loudness data of Fletcher 
and Munson combined with the curve 
for minimum audible pressure deter- 
mined by Sivian and White. Al- 
though the contours are for pure tones, 
they are presumably applicable to 
narrow bands of noise whose width is 
of the order of a critical band (Zwicker, 
Flottorp, & Stevens, 1957). 

Figure 1 shows the loudness esti- 
mates plotted against loudness level 
in phons. The line through the data 
has the slope (exponent) of the stand- 
ard "sone" scale. Although a few of 
the points deviate fairly far from the 
line, the overall picture provides an 
interesting confirmation of the loud- 
ness function, for Os were required to 
judge loudness in the face of wide 
variations in other aspects of the 
stimuli. Part of the variability must, 
of course, be attributed to imperfec- 
tions in the equal-loudness contours 
that were used to determine loudness 
level. Some of the variability must 
also be due to the inevitable uncer- 
tainties regarding the true sound Dres- 
sure levels in the ears of the listeners 


(Beranek, 1949, p. 738). 
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The data agree with the sone scale 
of loudness somewhat more closely if 
the equal-loudness contours presented 
by Stevens and Davis are altered so as 
to preserve their upward concavity 
throughout the entire intensity range. 
The agreement then is quite good, and 
it suggests that the cqual-loudness 
contours could be mapped out directly 
by magnitude estimation if a suffi- 
ciently thorough experiment were run. 


Density and Volume 


Three experiments were run to scale 
cach of the attributes, volume and 
density. The geometric means of the 
magnitude estimations are listed in 
Table 1. In order to verify that the 
data for density were similar to those 
obtained in the earlier study (Guirao 
& Stevens, 1964) with bands of noise, 
the magnitude estimations were plot- 
ted against SPL with frequency as a 
parameter. The technique employed 
was that used in the earlier study and 
the outcome was essentially the same. 
Density was found to grow as a power 
function of sound pressure. The ex- 
ponent decreases as frequency in- 
creases. Only at the frequency 250 
cps was there a suggestion of a slightly 
different exponent from that found in 
the earlier study. Consequently, this 
test of the adequacy of the data ap- 
peared to be reasonably well satisfied. 

Precisely the same test could not be 
applied to the volume judgments be- 
cause the volume functions have not 
been determined for bands of noise. 
It was nevertheless possible to com- 
pare the data for volume in Table 1 
with the volume functions for pure 
tones presented by Terrace and 
Stevens (1962). There is generally 
good agreement, but with occasional 
poor agreement at the highest fre- 
quency tested. It was typically the 
frequencies in the region above 3,000 
cps that caused the most difficulty in 
all aspects of these experiments. 
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P ud each stimulus the geometric 
Dalen of the magnitude estimation of 
ume was multiplied by the corre- 
SPonding estimation of density and 
Hotted on the abscissa in Fig. 2. On 
Me ordinate in Fig. 2 are plotted the 
qpe tude estimations of loudness. 
he both coordinates are psycho- 
Ysical attributes. I 
auditory density is proportional 
anal udness divided by volume, as the 
su Ogy with physical density might 
oo the data in Fig. 2 should 
ara a slope of 1.0 in the log-log 
ine inates, That is the slope of the 
sume tally drawn. The fit appears 
lag tly good to warrant the con- 
Sion that loudness is proportional 


to lo 


100 
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to volume times density. The most 
deviant point corresponds to one of 
the high-frequency stimuli, 6,000 cps. 


Inverse Functions 


It has been demonstrated on at 
least seven perceptual dimensions that 
Os can make magnitude judgments of 
the inverse of a continuum (Stevens & 
Guirao, 1963; Torgerson, 1960). The 
Os produce thereby a function that 
approximates the reciprocal of the 
power function obtained when the 
continuum itself is judged. This pro- 
cedure of inverse judgments promised 
a means by which the validity of the 
relation shown in Fig. 2 could be 
checked by an independent approach. 
The strategy was to scale each of the 
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TABLE 2 


GEOMETRIC MEANS (GM) AND QUARTILE Deviations (QD 1N DECILOGS) OF THE MAGNITUDE 
3EOME ? 3 ~ 


ESTIMATES OF THREE INVERSE ATTRIBUTES OF NARROW BANDS OF NOISE 


Noise Bands Softness Smallness Diffuseness 
Nois | 3 i E. AE NE. 
Freq. | SPL | GM | QD GM op GM op 
0 100 1.1 1.5 2.0 L5 4.6 2.9 
2 580 90 1.2 20 | 40 20 21 3.0 
"325 80 53 | 12 | 42 1.6 16.5 1.5 
1,000 70 70 | 08 | 99 0.5 9.8 0.9 
6,000 70 8.9 09 | 31.0 20 28 2.0 
3,000 50 16.7 | 13 | 285 0.8 84 2.0 
720 42 420 | 13 109 | 30 360 | 1.9 
200 50 630 | 17 | 43 | 3.4 59.0 | 17 
i | 


Note.—The center frequency and sound pressure lev 


65 db. SPL was presented at the beginning of each run and called "10," 


inverse continua, softness, smallness, 
and diffuseness, and to test the pro- 
portionality between softness and the 
product of smallness and diffuseness. 

The stimuli and the geometric 
means of the magnitude estimations 
are listed in Table 2. 

In order to examine the degree to 
which the inverse judgments approxi- 
mate the reciprocal of the direct judg- 
ments, the two sets of geometric 
means for each continuum were 
plotted in Fig. 3. The lines with 
slopes of —1 indicate the predicted 
slope of the inverse functions on the 
assumption that they obey the recip- 
rocal law. 

The magnitude estimations of soft- 
ness are almost the reciprocal of the 
estimations of loudness. There is a 
slight tendency for the function, soft- 
ness vs. loudness, in Fig. 3 to be 
concave downward, a feature that has 
characterized many experiments on 
inverse scaling. The reason for this 
curvature is not understood. 

Evidence of a similar slight curva- 
ture is apparent in the plot of diffuse- 
ness vs. density, although the recip- 
rocal relation is closely approximated. 

Volume (largeness and smallness) 
fared less well on the whole than did 
1 ributes. The estimates 
the other attri : loss to th 
of smallness fall fairly close to the 


are given for cach band. A standard at 910 cps and 


reciprocal line, but there is a general 
constriction of the overall range of the 
judgments. The fact of the matter is 
that Os reported more trouble with 
volume, both largeness and smallness, 
than they reported with loudness or 
density. This fact does not neces- 
sarily mean that volume is the most 
difficult attribute to judge. It may be 
relevant that most of the Os had 
served in experiments on density 
during the months prior to the present 
series of experiments. It is conceiv- 
able that a practiced familiarity in the 
abstracting of the density attribute 
made it difficult to switch to volume. 

In any case, the goodness of the fit 
to the reciprocal functions in Fig. 3 
suggests that the inverse judgments 
should be adequate to test the relation 
between softness and the product of 
smallness times diffuseness. Accord- 
ingly, the geometric means of the in- 
verse judgments were plotted in Fig. 
4. The line in Fig. 4 has the slope 1.0 
predicted by the hypothesis. The 
point that departs farthest from the 
line represents the stimulus whose 
Írequency, 200 cps, was the lowest 
used and the one that produced the 
largest variability in the judgments of 
smallness. The other points in Fig. 4 
are satisfactorily close to the predicted 
line. 
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100 mn T T 
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Product of smallness times diffuseness 


Fic. 4. Approximate proportionality between softness and the product of smallness times 


diffuseness. 


(This relation is predicted by the function in Fig. 2. 


Exact proportionality is 


indicated by the straight line with a slope of 1.0.) 


for the geometric rather than the 
arithmetic mean in the averaging of 
magnitude estimations. 

The medians of all the quartile 
deviations show that the variability 
tended to be smaller for judgments of 
loudness (1.5 decilogs) and density 
(1.5 decilogs) than for judgments of 
volume (2.0 decilogs). f . 

The median of the quartile devia- 
tions for the judgments of softness 
(1.3 decilogs) was smaller than that 
for the judgments of smallness (1.8 
decilogs) and diffuseness (1.9 decilogs). 
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EFFECT OF STIMULUS VARIABLES O 
REACTION TIMES AND THRESHOLDS! 


J. KASWAN asp S. YOUNG 


University of California, Los Angeles 


For 8 different groups of Ss, individual patterns from sets of either 
4-, 3-, or 2-alternative patterns were shown to each S under 2 conditions 
in a forced-choice recognition task. In one condition, exposure 
duration was varied by E and thresholds were determined for each 
pattern. In the other condition, choice r on times (CRT) were 
obtained, measured from stimulus onset to S's key press. It was 
predicted and found that the order of CRTs would parallel the order of 
thresholds in each combination of patterr This finding supports the 
assumption that the time required to receive spatial information suffi- 


CHOICE 


cient to distinguish patterns is an identifiable component of CRT. 


A major purpose of this study is to 
examine the role of the 
duration threshold in the total time 
Fequired for the identification of a 
Visual pattern, as indicated by a 
choice reaction time (CRT). 

Previous work has shown that the 
Probability of correctly identifying a 
Pattern consisting of a single horizon- 
tal array of variably spaced elements 
IS a function of increasing exposure 
Curation, The exposure duration re- 
{ured for a given level of correct 
response to a particular pattern de- 
bends on (a) the size and configuration 
Of differences between spaces within 
a pattern and (b) the spacing se- 
(uences which characterize the alter- 
native patterns (see Kaswan, Young, 
* Nakamura, 1965). 

i In the present study, it is assumed 
that exposure-duration thresholds re- 
€ct the time required to obtain 
Sufficient spacing information about a 
Pattern to distinguish it from other 
Patterns, The assumption that stim- 
"lus patterns must be exposed for 


exposure- 


a. This study was supported by a grant from 
Te P Buibsot Foundation (G-9589) 
(Src Public Health fellowship for S. Young 
I "I-MH-19,664-02). We wish to thank 
hg homer for helping to collect the data, and 
“nk Gale Kratz for helping to analyze it. 


some small but specific amount of time 
before they can be accurately dis- 
criminated suggests that this time 
should be reflected in measures other 
than exposure-duration | thresholds. 
We expect that in a task in which a 
stimulus is exposed throughout the 
choice reaction time, measured from 
stimulus onset to S's response, the 
function of the initial part of the 
exposure is largely to convey spacing 
information about the pattern. The 
initial component of the CRT thus 
presumably includes a time correlated 
with the threshold. If the differences 
in CRT for patterns within a set of 
alternatives are due largely to differ- 
ences in exposure duration required to 
identify these patterns, then differ- 
ences in CRT should parallel differ- 
ences in threshold. 


METHOD 


Materials —Four stimuli were used in the 
study. Each stimulus consisted of a single 
linear horizontal array of }-in. vertical lines 
printed on blank off-white cardboard. The 
Ev pattern array consisted of 26 lines spaced 
evenly with 6 mm. between lines. The U 
stimulus contained 24 lines spaced with ran- 
domly located 6-, 8-, 7-, and 5-mm. spaces. 
The 3 stimulus contained 24 lines spaced in 
the sequence 5, 5, 7, 5, 5 mm. to appear as 
groups of three lines. The 1,2 stimulus con- 
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sisted of 24 lines spaced in the sequence 6, 4, 
8, 6, 4 mm. (see Kaswan et al., 1965, for 
samples of the Ev, 3, and 1,2 stimuli). 

Subjects. —A total of 146 men and women 
college freshmen and sophomores was ran- 
domly divided into eight groups. A four- 
alternative group of 30 Ss was presented with 
the four stimuli described above. A three- 
alternative group of 20 Ss received the Ev, 3, 
and 1,2 patterns. There were six two-alter- 
native groups of 16 or 17 Ss. Each two-alter- 
native group received a different combination 
of two of the four patterns. 

Procedure.—Each S had two sessions, a 
tachistoscopic session for threshold determina- 
tion and a CRT session, spaced 1 wk. apart. 
The order of sessions was counterbalanced 
within groups so that about half the Ss in each 
group received each order. 

At the beginning of the first session, .$ was 
requested to describe in his own words cach 
stimulus of the set on which he was to be 
tested. If his description contained the label 
intended by E, or an appropriate synonym, 
E selectively specified the label for the stim- 
ulus as "even," “three,” “uneven,” or “one- 
two." The S was instructed to use these 
labels in the experiment. A short practice 
run appropriate to the session was given in 
both tachistoscopic and CRT sessions. The 
S was rejected if he could not make an un- 
ambiguous description within an initial 30- 
sec. presentation of each stimulus appropriate 
to his group. Three Ss were rejected for this 
reason. All stimulus presentations were 
preceded by a foreperiod of 1.0 sec. initiated 
by a .5-sec. ready signal (buzzer). The S 
fixated on a blank field prior to the 
exposure. 

To determine thresholds, each pattern of 
the appropriate set was presented twice at 
each of 17 exposure durations ranging from 
.01 to 1.00 sec. The durations used were .01, 
.02, .03, .04, .06, .08, .10, .15, .20, 25, .30, 
.40, .50, .60, .70, .80, .90, 1.0 sec., with the 
1.0-, .01-, and .03-sec. exposures omitted from 
the two-, three-, and four-alternative condi- 
tions, respectively. The patterns were 
presented by the method of constant stimuli 
with pattern and exposure-duration sequence 

mized. 

mr CRT session, cach pattern was 
presented tachistoscopically 34 times on a 
random schedule. The S was instructed E 
depress a single microswitch key as soon a e 
could assign a standard label to the es 
presentation. Following the key press w ich 
terminated the stimulus exposure, 5 was re- 

ired to vocally report the label assigned to 
quire 


the exposure. Choice reaction times, meas- 
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ured from onset of the stimulus exposure to 
its termination by S, were recorded to the 
nearest .01 sec. A single key was utilized 
because of the possibility that the usual 
arrangement of separate keys for each re- 
sponse alternative introduced additional dis- 
criminative and perceptual-motor coordina- 
tion factors. . 

A pparatus.—All stimuli were presented in 
a two-field Gerbrands mirror tachistoscope. 
Buzzer signal and foreperiod were controlled 
with Hunter timers. Choice reaction times 
were measured with a Hunter klockkounter. 


RESULTS AND DISCUSSION 


A division of the 34 CRTs made by 
each S to each pattern into sequential 
thirds (after elimination of the longest 
CRT) indicated that CRTs became 
faster from the first third to the last 
third, and that the last 11 CRTs ap- 
peared consistently more stable than 
the earlier ones. All CRT results 
reported here were therefore based on 
a mean of the last 11 CRTs given by 
each S to each of the patterns to which 
he was required to respond. 

The proportion of error responses 
to all stimuli in the CRT condition 
was very small. The mean propor- 
tions for two-, three-, and four-alter- 
native conditions were, respectively, 
-01, .01, and .02. The range over the 
three conditions was .00—.08. 

Table 1 shows the median of mean 
CRTs and median thresholds to each 
pattern in each combination (group). 
Analyses of variance indicated that in 
the two-alternative condition, differ- 
ences in threshold between patterns 
within combinations exceeded the 
-05 significance level for the Ev-U, 
F (1, 15) = 8.72; the U-1,2, F (1, 16) 
— 4.78; and the 3-1,2 combinations, 
F (1, 15) = 495. In separate com- 
parisons between CRTs within com- 
binations in this condition, only the 
U-Ev combination reached signifi- 
cance, F (1, 15) = 16.32, p < .01. 
Analyses of variance of threshold 
differences between patterns in the 
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TABLE 1 
MEDIAN THRESHOLDS AND CHOICE Times (CRT) ror EACH PATTERN 
WITHIN EACH ALTERNATIVE COMBINATION (SET) 
Stimuli within Two-Alternative Sets 
Ev U | Ev 3 | Ev 12|U 3 U 12| 3 12] X 

M. CRT (msec.) 464 502 |576 606 | 360 367 | 552 510 |532 504|463 491 | 434 
Mdn. threshold (msec.)| 10 60| 10 25| 10 10} 30 15] 60 35] 10 25| 25 


Three-Alternative Set 


ios J 
Ev | 1-2 | 3 | X 
Mdn. CRT (msec.) |576 | 614 | 643 |611 
Mdn. threshold (msec.) | 45| 80] 150} 92 


Four-Alternative Set 


Ev| 3 [12] Uu | € 
685 | 856 | 910 |1024| 834 
10 | 200 | 250 | 300| 190 


three- and four-alternative conditions 
all exceeded the .01 level of confidence, 
F (2, 38) = 6.00 and F (3, 84) = 9.13, 
respectively, Pattern CRTs differed 
Significantly in the four-alternative, 
i (3, 84) = 26.7, p < .001, but not in 
the three-alternative condition. 
, The findings of major interest are 
d parallel between threshold and 
T. Table 1 shows that within each 
ut the eight groups, the rank order of 
thresholds and CRTs was the same. 
latistical tests (analyses of variance 
Or the two-alternative condition and 
an order test developed by Page, 1963, 
Fun the other two conditions) evaluat- 
‘8 the probability of obtaining such 
Parallel orderings by chance in each 
“Ombination, all led to the rejection of 
© null hypothesis beyond .001 level 
Confidence, 


"ss Obtained parallel between the 
up ern order for thresholds and i 
f De the assumption that at re ca 
with, T CRT of stimuli 
imn on depends on 
quip e Ces in the amount of time re- 
Or tl to receive sufficient information 
tha PE recognition of each pattern, 1-6» 
old “me which largely determines thresh- 
his parallel may also be inter- 
indicating that the time 
by the threshold is not an 


ifference in the 
each combinati 


artifact of the tachistoscopic method but 
reflects a general determinant of pattern 
perception. 

Since thresholds are assumed to be a 
rough measure of the time required to 
obtain information about differences be- 
tween spaces necessary for accurate 
percepts, it seems reasonable to suppose 
that most of the processing of such 
spacing information should begin only 
toward the end of the threshold period, 
since information which has not been 
received cannot be processed. Assuming 
constant processing time for stimuli 
within a combination, this formulation 
therefore predicts that CRT differences 
between alternative patterns should cor- 
respond to threshold differences. This 
expectation is closely approximated in 
several combinations shown in Table 1; 
E-U and 1,2-U in the two-alternative 
condition; E-1,2 in the three-alternative ; 
and E-3, E-1,2, and 3-1,2 in the four- 
alternative conditions. 

The above distinction between the tem- 
poral aspects of information reception 
and processing is entirely analytic, fol- 
lowing logically from a formulation of the 
spatio-temporal determinants of informa- 
tion input (see also Kaswan et al., 1965). 
While this distinction implies some sort of 
neurophysiological information-assembly 
mechanism which combines information 
over time prior to processing, specula- 
tions about the characteristics of such a 
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system are beyond the scope of this 
Paper. fh ar <a gr 
Apart from the relatively brief, initial 
time component, the role of stimulus 
determinants of CRT is not clear. Thus, 
beyond the stimulus rank-order simi- 
larity for threshold and CRT within each 
stimulus combination, variations in CRT 
were, in many respects, different from 
threshold variations. For example, while 
threshold varied little for the Ev pattern 
in the different combinations, CRT for 
this pattern varied considerably. Simi- 
larly, Table 1 shows that rank order, 
from lowest to highest, of the two- 
alternative CRT means does not parallel 
that obtained for the two-alternative 
threshold means. For example, the long- 
est CRT was for the 3 pattern in the 
Ev-3 combination, but the highest 
threshold was in response to the U pat- 
tern in the EV-U and 1,2-U combina- 
tions. Median CRT for patterns within 
each combination thus does not appear to 
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be determined entirely by the difficulty of 
discriminating the design involved when 
threshold is the criterion for the ease of 
discrimination. This discussion indicates 
that unlike relative CRT to patterns 
within cach of the eight combinations 
used, variations in CRT to cach pattern 
in different combinations (e.g. CRT to 
the 1,2 pattern in the 1,2-Ev as compared 
to the 1,2-3 combination) may not be 
directly related to the time required to 
receive stimulus information. 
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RESISTANCE TO EXTINCT 


ION FOLLOWING BLOCKING 


OF THE INSTRUMENTAL RESPONSE 
DURING ACQUISITION ! 


W. EDWARD BACON 


McGill 


+ groups of 11 rats each were tra 
alleyway and were then e 
respectively, 10 and 20 cont 
Groups P and B received 20 tri 
schedule. 
nonrewarded training tri 
ures indicated that the P and B gr 
to extinction than the C; and 

activities suggest that the blocki 
variable alters start-box behavior. 


It is established that a response is 
SE resistant to extinction following 
oe reinforcement than follow- 

§ continuous reinforcement training 

ewis, 1960). There is also evidence, 
though inconclusive, that temporar- 

Y blocking the response at the goal 
20X on some acquisition trials (partial 
End of reinforcement) increases re- 
TI ànce to extinction (Renner, 1964). 
ww literature, however, offers no 
Mie a concerning the effect that 
the i id blocking the response in 
ide eyway on some acquisition por 
lacie. extinction, although gage 
Tesis Ag during extinction decreases 
istance to extinction (Klugh, 1961; 
ambert & Solomon, 1952). Theaim 
aide present study was to provide 
| information. Following Bindra's 
63a) suggestion that "irrelevant 
Avior occurring in the start box 
Y be crucially related to extinction 
HET, various spontaneous activities 
i e 
Sil 
tion, 
for Re y 
Course a and support 
dement the experiment. Gr 
assi is extended to L. 


Sta; 3 
data nce in collecting and anal 
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eff. 


ported by a re- 
) from the Na- 
dealth to D. 


S research was sup 
grant (MII-03238-04 
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throughout the 


-ateful acknowl- 
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inuous 
als daily on a random 50% reinforcement 
Group B Ss were prevente! 
als by occlusion of the start box. 


University 


ined to run for water in a right-turn 


xtinguished. Groups C; and Cs received, 


sly reinforced training trials per day. 
d from running to the goal box on 
‘Time meas- 
oups were significantly more resistant 
Ce groups. Measures of irrelevant 
ng but not the partial-reinforcement 


of the animal in the start box were 
also recorded. 


METHOD 


Subjects 

The Ss were 44 male hooded rats, 71-88 
and weighing approximately 210 
f the experiment. They were 


days of ag 
at the start 0! 


gm. They: 
obtained from the Quebec Breeding Farm, 
Incorporated, St. Eustache, Province of 


ere housed in standard colony 


Quebec, and w 
e, throughout the study. 


cages, six per cag 


Apparatus 
The apparatus was à right-turn, L-shaped 
black wooden alleyway, 9.5 in. high, covered 
with hinged Plexiglas sections. The main 
arm was 64 in. long and 4 in. wide, with the 
exception of the first 9 in. which was 6 in. 
wide and served as the start box. The short 
arm was 14 in. long and 6 in. wide and 
constituted the goal box. A black metal 
guillotine door separated the start box from 
the running alley and a similar door located 
39.5 in. down the running alley marked the 
beginning of the goal box. A photoelectric 
cell, activated by a light beam passing through 
an infrared filter, was situated in the side wall 
of the running alley 1.5 in. above the floor 
and 2.5 in. from the start-box door. A second 
photoelectric cell, activated by a similar light 
source, was located down the running alley 
42 in. from the first photoelectric cell. A 
metal drinking tube containing a x-in. fluid 
hole projected through a 3.in. hole cut in the 
far wall of the goal box 3 in. above the floor. 
On nonreinforced trials a 6 X 9.5 in. black 
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false wall of }-in. plywood covered the hole in 
the goal-box wall. On “blocked response 
trials" a 4 X 9.5 in. black wooden partition 
was inserted into the running alley ap- 
proximately 1 in. from the start-box door. 
The raising of the start-box door started a 
.01-sec. Standard Electric timer. This timer 
stopped and a second similar timer started 
when S interrupted the first light beam. The 
second timer stopped when S passed through 
the second light beam. The timers registered 
starting and running times, respectively. 


Procedure 


Taming and habituation —Ten days prior 
to pretraining, Ss were placed on a 2023.5 hr. 
water-deprivation schedule, which consisted 
of 3 hr. access to water daily and which was 
maintained throughout the experiment. Each 
S was given 2-4 min. of gentling daily on at 
least 5 of the 10 days prior to pretraining. 
On the fourth and third days prior to pre- 
training, thirsty Ss were placed in the alley- 
way goal box (goal-box door closed) for 5 min. 
with water present, 

Pretraining.—Each S was given five pre- 
training trials. A trial consisted of placing .S 
in the start box, raising the start-box door 
after a 15-sec. delay, lowering the start- and 
goal-box doors after .S passed under them, and 
allowing S 15 sec. access to water in the goal 
box. "The intertrial interval was approxi- 
mately 45 sec. and each rat was given } hr. 
access to water approximately 3 hr. after the 
last trial of the day. 

Measures of each S's activity were ob- 
tained during the 15-sec, period of delay in the 
start box by means of a modified time- 
sample method (Bindra & Blond, 1958). On 
each trial, five samples of a start-box activity 
were recorded. At the end of each 3-sec. 
interval, the “on-going” activity was catego- 
rized and recorded as either grooming (G), 
sitting (L), sitting-sniffing (L-S), rearing (R), 
rearing-sniffing (R-S), or locomotion (W). 
These categories have previously been de. 
fined (Bindra, 1963a; Bindra & Blond, 1958). 

Acquisition training. —F ollowing pretrain- 
ing, Ss were randomly assigned tofour groups, 
11 per group, for 6 days of acquisition train- 
ing. Group P received 20 trials per day on 
a 50% reinforcement schedule randomized 
within blocks of 20 trials with the restrictions 
that the first and the last trials of each block 
be rewarded and that runs longer than 3 
rewarded or 3 nonrewarded trials be excluded. 
Group B received 20 trials per day with 50% 
of the trials being blocked response trials. 
On these trials the start-box door was opened 
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after the 15-sec. delay but the inserted parti- 
tion prevented entry into the running alley: 
The S was removed from the start box 15 sec. 
after the door was opened. The ordinal 
position of the blocked trials corresponded to 
nonrewarded trials for Group P. Groups Ci 
and Cs received 10 and 20 trials per day, re- 
spectively, on a 100% reinforcement schedule. 
These two continuously reinforced groups 
were included in the experimental design as 
Group B Ss received only 10 opportunities to 
run to the goal box out of 20 daily trials. All 
other running and recording procedures were 
similar to those employed during pretraining 
with the exception that start-box activity was 
recorded only on the first 5 and last 5 trials 
of each day. 

Extinction.—Extinction trials began the 
day following the completion of acquisition 
training and continued for 3 days. All Ss 
received 20 nonreward trials per day provided 
they did not meet the extinction criterion of 
failing to leave the start box within 2 min, or 
to enter the goal box 2 min. after leaving the 
start box on three successive daily trials. 
Extinction trials were terminated for that day 
when S fulfilled the criterion, and start- and 
running-time scores of 120 sec. were assigned 
to the uncompleted trials. Other running and 
recording procedures were similar to acquisi- 
tion training. 


RESULTS 
Start-Box Activity 


The frequency of occurrence of each 
category of start-box activity was 
determined for each S for the pre- 
training trials and for each daily block 
of training and extinction trials. The 
daily frequency measures were con- 
verted to proportions by dividing the 
frequency measure by the total num- 
ber of daily observations on each S. 
The proportions were transformed to 
arc-sine scores as suggested by Mos- 
teller and Bush (1954) and the arc- 
sine scores for each of the six cate- 
gories of start-box activity were 
subjected to separate analyses of 
variance, 

The results of the analyses (Table 1) 
indicate that reliable group differences 
occurred only for the activities of R-S 
during acquisition and R-S and L-S 
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TABLE 1 
ANALYSIS OF VARIANCE ON THE ÀARC-SINE TRANSFORMATIONS OF FREQUENCY OF OCCURRENCE 


OF START-BOX ACTIVITIES DURING PRETRAINING, ACQUISITION, AND EXTINCTION 


F 
Source df 
L R G LS R-S w 

Pretraining 

etween 

Groups (G)| 3 0.68 0.72 2.11 0.16 0.25 1.42 

Error (MS) | 40 | (156.8) (66.5) | (44.7) (68.1) (48.3) (53.1) 
Acquisition 

etween 

G 3 2.35 1.01 1.99 2.30 4.29* 0.52 
weiter (ars)} 40 | (259.8) (101.1) | (230.3) | (235.8) | (269.5) (166.1) 

thin 

Days (D) 5 10.91** 0.90 4.06** 145 2.49* 0.84 

GXD 15 1.17 1.59 | 1.18 1.63 0.54 0.46 

Error (ALS) |200 | (43.3) (20.0) | (29.1) (38.2) (39.4) (28.0) 
Extinction 

etween 

G 3 0.75 0.97 1.01 3.20* a7" 0.16 
waiter (MS) | 40 | (298.5) (16.5) | (274.7) | (164.0) | (195.3) (89.6) 

"ithi 

D 2 2.28 2.93 &97*| 163 4.39** 2.28 

GxDp 6 0.43 1.64 0.73 1.76 1.23 0.77 

Error (M.)| 80 (95.2) (9.9) (43.5) (45.2) (44.2) (32.1) 

* 

P s 

"5 i2 


during extinction.  Scheffe's test for 
hultiple comparisons (Edwards, 1960, 
a 154—156) on the overall mean ac- 
luisition R-S scores indicates that 
ferr B Ss did significantly less 
Ting-snifüng than Group C» Ss 
Sony 05). Other group compari- 
(b's Were not statistically reliable 
Cari > .05). Similar comparisons 
tea Out separately on the overall 
even extinction R-S and L-S scores 
nig led that Ss in Group B did sig- 
in qnd less rearing-sniffing than Ss 
an te other three groups (p's < .05) 
Si&nificantly more sitting-sniffing 
roup P Ss (p < .05). All other 
Þarisons proved to be nonsignifi- 
T (P's > .05). . 
liabi start-box activities showed 
fleet c 
has S- 


E 
t 
Co: 
Ds 


Te 
hanges across trials (Day 
There was a tendency for 
to increase as a function of 


acquisition trials and decrease as a 
function of extinction trials. The 
frequency of occurrence of L steadily 
declined during acquisition but was 
not reliably affected by extinction. 
Grooming increased during early ac- 
quisition and then showed a further 
increase during extinction. 


Instrumental Response 


Pretraining.—Start and running 
times were averaged separately across 
the five pretraining trials for each S. 


: Analyses of variance on the mean time 


scores indicated no statistically signifi- 
cant differences between the four 
groups for either start time, F (3, 40) 
= 0.04, p > .05, or running time, 
F (3,40) = 1.46, p > 0.5. 
Acquisition training.—Mean start- 
time and running-time scores were 
computed for each daily block of trials 
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MEAN RUNNING TIME VARIABILITY SCORES 
FOR THE FOUR EXPERIMENTAL GROUPS 
DURING EARLY AND LATE ACQUISITION 


Early. Late. 

Groups Acquisition Acquisition 
10.80 9.77 
& 28.48 6.59 
B 15.25 4.57 
P 35.69 2.97 


(20 trials per block for Groups C; and 
P and 10 trials per block for Groups 
C, and B). Analyses of variance in- 
dicated a reliable Trial Block effect 
for both start time, F (5, 200) — 15.25, 
p «.01, and running-time means, 
F (5, 200) = 15.87, p «.01. How- 
ever, the Group effect was not signifi- 
cant for either start time, F (3, 40) 
= 1.21, p> .05, or running time, 
F (3,40) = 0.81, p > .05; nor was 
the Group X Trial Block interaction 
significant for start time, F (15, 200) 
= 1.10, p> .05, or running time, 
F (15, 200) = 0.80, p > .05. In gen- 
eral, although performance of the four 
groups improved with trials, there 
were no differences in performance 
between the four groups when re- 
sponse times were averaged across 
blocks of daily trials. 

To determine whether the variables 
of nonreward and blocking increased 
response variability, the ranges of the 
start times and running times for the 
first 20 (early acquisition) and last 20 
acquisition trials (late acquisition) 
were determined for blocks of 5 trials 


and averaged to yield early and late: 


acquisition variability scores. Anal- 
yses of variance revealed that animals 
were considerably less variable during 
rly acquisition 
late as compared to ear 
yoth start-time, F qd, 40) = 20.16, 
jx OL, and running-time measures, 
f; (1,40) = 18.71,p < JT. The only 
indication of a difference in variability 


for | 
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between the four groups was the 
significant Group X Early-Late Ac- 
quisition interaction (Table 2) for the 
running-time measure, / (3,40) = 3.21, 
p «.05. Scheffe's test for multiple 
comparisons showed no reliable differ- 
ences (p's > .10) between any of the 
groups during late acquisition. Only 
the P vs. C; difference was significant 
(p < .05) among the early acquisition 
comparisons of pairs of means. Ex- 
amination of the group means in Table 
2 and the significant C,-- P vs. 
C; + B comparison (p < .05) suggests 
that response variabilityis greater dur- 
ing early acquisition for the two groups 
receiving 20 daily trials (Cs and P). 
Extinction.—Start times and run- 
ning times were averaged across blocks 
of 20 extinction trials for each S and 
the trial-block means were subjected 
to analyses of variance. The results 
of the analysis on start time showed 
a statistically reliable Group effect, 
F (3,40) = 4.32, p < .01, and Trial 
Block effect, F (2, 80) = 10.09, p < .01, 
but a nonsignificant Group X Trial 
Block interaction, F (6, 80) — 0.77, 
p 2.05. A similar analysis yielded 
a significant Group effect, F (3, 40) 
= 6.43, p < .01, and Trial Block 
effect, F (2, 80) = 13.34, p < .01, but 
a nonsignificant interaction, F (6, 80) 
= 1.04, p> .05, for running time. 
Figure 1 presents mean response times 
over blocks of 20 trials for the four 
groups. Both start- and running-time 
measures increased across trial blocks 
for all groups. Examination of the 
individual group performance curves 
reveals that there was little difference 
between Groups C; and C2 or between 
Groups B and P. Scheffe's test in- 
dicated that these comparisons were 
not statistically reliable for either the 
overall start- or running-time means 
(p's > 10). However, the C, + C» 
vs. B + P comparisons were significant 
for both time measures (p’s < .05). 
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The blocked and partially reinforced 
Ss showed greater resistance to ex- 
tinction than the continuously rein- 
forced Ss. 

Visual inspection of the group 
performance curves also suggests that 
Group B Ss showed somewhat less 
initial resistance to extinction but 
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better terminal extinction perform- 
ance than Group P Ss. Analyses of 
variance were carried out on five-trial 
start- and running-time means for the 
Group B and P data only. Although 
the Group X Trial Block interaction 
was not reliable for start time, 
F (11, 220) = 1.07, p > .05, the sig- 


3 


BLOCKS OF 20 TRIALS 


Fiki 


Mean start and running ti 
Stoups, i 


bs (Block 0 indicates the 
ks 1-3 the 3 extinction days-) 


mean start or run 


me across blocks of trials for the four experimental 


ning time on the last day of acquisition and 
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nificant Group X Trial Block inter- 
action for running time, / (11, 220) 
= 2.78, p < .05, provides some sup- 
port for the above suggestion. 


DISCUSSION 


The changes in occurrence of groom- 
ing, sitting, and rearing-sniffing in the 
start box across acquisition and extinc- 
tion trials agree with data reported by 
Bindra (1963a) with one exception. 
Bindra found that the duration of time 
spent sitting in the start box decreased 
across extinction trials; whereas, in the 
present experiment, the frequency of 
occurrence of sitting did not show any 
reliable change during extinction. This 
discrepancy in findings may be due to the 
different methods of measuring start-box 
activity used in the two studies. 

Bindra (1963b) has presented evidence 
suggesting that in a lever-pressing situa- 
tion irrelevant activities are related to 
such phenomena as the partial-reinforce- 
ment effect. He contends that in an 
alleyway situation the effect of goal-box 
events such as nonreward is to alter 
subsequent behavior in the start box 
which in turn affects the instrumental 
response (Bindra, 1963a). The absence 
of any difference in start-box activities 
between partially and continuously rein- 
forced Ss, together with the presence of 
a clear difference in instrumental re- 
sponse times between these groups during 
extinction, does not support Bindra's 
contention. However, the finding that 
blocking did alter start-box behavior, 
particularly rearing-sniffing, suggests that 
the relation of start-box activity to ex- 
tinction phenomena is in need of further 
clarification. g " . 

Of special theoretical interest is the 
finding that the blocked response condi- 
tion resulted in a marked increase in 
resistance to extinction; almost, if not as 
great, as that due to partial reinforce- 
ment. Lawrence and Festinger (1962) 
propose that in an alleyway situation 

in events such as nonreward cause 
cane in the rat. On nonrewarded 
em rn trials partially reinforced Ss 
decim "extra attractions'' in the goal 
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box which reduce dissonance and, in 
turn, increase resistance to extinction. 
Blocked Ss never experienced nonreward 
in the goal box during acquisition and 
hence should not have developed any 
extra goal-box preferences. The in- 
creased resistance to extinction following 
blocking of the instrumental response 
would not seem compatible with disso- 
nance theory as developed so far. 

On the other hand, the extinction 
findings can be accounted for within 
the framework of frustrative-nonreward 
theory of Amsel (1958, 1962) if it is 
assumed that blocking, like nonreward, 
elicits frustration and, further, that a 
secondary form of frustration similar to 
the fractional anticipatory frustration 
reaction (rj) can work forward in the 
alleyway through stimulus generaliza- 
tion. If this be the case, both blocked 
and partially reinforced Ss would have 
become conditioned to run to cues of 
frustration (s;) associated with »n dt 
Should be noted that P and B animals 
did not exhibit significantly greater re- 
Sponse variability during early acquisi- 
tion than respective C Ss with an equal 
number of daily trials as predicted by 
frustration theory, Subsequent analysis 
suggests that increased variability due to 
frustration may have been obscured by 
factors associated with closely massed 
trials. 

The present study yields at least three 
kinds of information. First, it provides 
empirical evidence that the blocking of 
an instrumental response during acquisi- 
tion increases its resistance to extinction. 
Secondly, it casts considerable doubt 
upon the utility of the cognitive-disso- 
nance interpretation of extinction effects. 
Thirdly, it suggests that, in arriving at 
an explanation of extinction phenomena, 
one should not limit oneself only to 
events occurring in the goal box. Events 
occurring in other parts of the alleyway 
may be equally important. 
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This experiment investigates the relation between subjective probability 
and mathematical probability. Bayes’ theorem provides the correct 
revision of probabilities as a result of new data. New data, however, 
may come in different sized samples. Th experiment measured the 
accuracy of subjective probability revision as 


The results show that accuracy decreases as 
function is negatively accelerated. 
essed by increasing the sample size is 


Behavioral decision theory provides 
a framework for evaluating the extent 
to which selected behavior of human 
Ss corresponds with ideally consistent 
behavior as outlined by statistical 
decision theory (Edwards, 1961). The 
two primary variables of decision 
theory are probability and value; the 
corresponding psychological variables 
are subjective probability and utility. 
The present experiment studies the 
correspondence between subjective 
probability and mathematical prob- 
ability. 

Recent developments within sta- 
tistical decision theory focus on the 
correct revision of probabilities in the 
light of new information—the problem 
of revising the probability of a hy- 
pothesis as a function of the occur- 
rence of a relevant datum (e.g., see 


1'lhe research reported here was under- 
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a function of sample size. 
sample size increases; the 


The gain in amount of data proc- 


at the expense of accuracy. 


Schlaifer, 1961; or Raiffa & Schlaifer, 
1961). These developments revolve 
around the concepts of personal prob- 
ability and Bayes' theorem. Personal 
probabilities are ideally consistent 
opinions, and conform to the axioms of 
probability theory; subjective proba- 
bilities are the opinions of real persons, 
and may or may not conform to the 
axioms of probability theory. Bayes’ 
theorem is a derivation from the 
axioms of probability theory. A con- 
sequence of Bayes' theorem which is 
relevant to this paper is 


PQL|D) | P(D| Hs) P(H.) 
PD)  P(D|H, PER) 


or, more simply, 
Qı = LQ% [2] 


where Ha and H, refer to two different 
hypotheses regarding the state of the 
world, and D refers to a relevant 
datum which occurs. The Qo refers 
to the odds of H, to Hy prior to the 
occurrence of D, The Q; refers to the 
corresponding odds posterior to the 
occurrence of D. The L refers to a 
likelihood ratio; it is equal to the 
conditional probability of D given Ha 
divided by the conditional probability 
of D given H, Thus Equation 2 
specifies that the revision of the odds 
of Ha to Hy, is a function of the occur- 
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rence of D. This revision is accom- 
plished by multiplying the prior odds 
by the likelihood ratio to yield the 
posterior odds (for a detailed discus- 
sion of Equation 2 see Edwards, 
Lindman, & Savage, 1963, p. 218). 
Experiments carried out by Ed- 
wards and his associates indicate that 
Ss revise their subjective probabilities 
in the correct direction, but that the 
magnitude of revision is less than that 


specified by Bayes’ theorem (e.g 
Edwards & Phillips, 1963, 1964). 


Typically, in these experiments, S's 
task has been to revise his subjective 
probability about which H was gen- 
erating a sequence of data. A revision 
was made after the presentation of 
each datum in the sequence. f 
Information which results in à 
revision of subjective probabilities 
need not be restricted to a single 
datum at a time. Information may be 
provided in a sample of data. But 
more data in a sample implies more 
information processing. For some 
types of human information process 
ing, an increase in the amount of 
information presented to an S impairs 
his accuracy; for other types of in- 
formation processing, the chunking of 
More information into a presentation 
does not impair accuracy (Miller, 
1956). The primary purpose of the 
Present experiment was to investigate 
the function relating accuracy of data 
Processing to the size of the sample. 
Accuracy was measured by the degree 
to which Se’ revision of subjective 
Probabilities agreed with the Simus 
revision. Sample size referred to the 
Specific number of data in the sample 
Which provided the information for 
the revision of subjective probabilities. 


METHOD 


i ET -nerimental 

Experimental design7 The ope per. 
ism required the manipulation € ct upon 
ize and an evaluation Of sinn 
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accuracy in the revision of subjective proba- 
bilities. Urns with varying proportions of 
black marbles and white marbles were the Hs. 
Each trial consisted of a random selection of 
48 marbles (Ds) from an urn. After each 
trial Ss were shown which urn yielded the 
marbles. 

Subjccts.—Forty-four volunteer students 
from an introductory psychology course 
served as Ss in groups of four. 

Apparatus.—Two large beakers served as 
urns. Each urn contained approximately 
1,000 marbles of 3 in. diameter. Large num- 
bers of marbles were used so that, with con- 
tinual mixing, sampling without replacement 
could be considered a reasonable approxima- 
tion of sampling with replacement. Urn B 
contained three black marbles for every two 
white marbles; Urn W contained two black 
marbles for every three white marbles. In 
this symmetrical situation the ratio of most 
probable to least probable color was a con- 
stant 60:40 for both urns. ‘This relatively 
uncertain ratio insured that Ss were not re- 
quired to make too many revisions near a 
ceiling of certainty, ie., subjective proba- 
bilities of 1 or 0. 

Four sample sizes were selected: 1, 4, 12, 
and 48. Relatively more small sizes were 
included because of the expectation that the 
function wotld be negatively accelerated. 
For samples of size 1, E drew single marbles 
ata time. Dippers were made which would 
“dip” the appropriate number of marbles for 
each sample. 

The response apparatus for obtaining sub- 
jective probability estimates was a 25-in. bar 
marked off into 100 equal units which were 
numbered from left to right. A sliding marker 
permitted S to divide the bar into two sec- 
tions. The length of the left section repre- 
sented S's subjective probability that the 
marbles observed were drawn from Urn B; 
the length of the right section represented S’s 
subjective probability that the marbles were 
drawn from Urn W. The length of the entire 
rod represented a probability of 1, so sub- 
jective probabilities for both urns were 
required to sum to 1. The number under 
the sliding marker represented the subjective 
probability of Urn B. In order to preclude 
problems inherent in responses of complete 
certainty, subjective probabilities were coded 
as no more extreme than .001 or .999. 

Black drapes divided one side of the ex- 
perimental room into four cubicles, each with 
an opening facing E, This permitted four 
individual Ss to be run simultaneously ; but 
] atmosphere approximated 


the experimenta. 
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the running of individual Ss in terms of both 
communication between Ss and rapport 
É or experiment consisted of 
eight blocks of four trials each. Each trial 
consisted of one of the four sample sizes. 
During each trial E drew and displayed 
successive samples of marbles until the 
cumulative number of marbles drawn num- 
bered 48. After each sample S was required 
to revise and record his subjective probability 
that E was drawing from Urn B. In order to 
avoid confounding the effects of sample size 
and within-block learning, the sample sizes 
were counterbalanced with respect to order of 
occurrence during a block. 

The Ss, four or fewer, were seated in 
cubicles facing E. A table in front of each S 
contained a probability bar and an answer 
sheet. The E instructed Ss with respect to 
the content of the urns and the use of the 
probability bar. For example, if S expected 
that it was twice as likely that E was drawing 
from Urn B rather than from Urn W, he 
would set the marker on the probability bar 
so that the Urn B side was twice the length 
of the Urn W side, The Ss were told that the 
purpose of the experiment was to compare 
their intuitive revision of subjective proba- 
bilities with the correct revision as calculated 
by means of a mathematical equation. There 
was a practice demonstration of the step-by- 
step procedure. 

The procedure of each trial progressed 
according to the following steps. (a) The E 
flipped a coin and selected the designated urn. 
(b) The E placed the urn into an opaque box 
so that Ss were not able to identify the urn. 
(c) Each S set the marker on the probability 
bar at .50. (d) The E drew at random a 
prescribed sample size of marbles and dis- 
played the sample to Ss, (e) On the basis of 
the sample, each EA reset his probability 
marker and recorded his subjective probability 
of Urn B on his answer sheet. This sequence 
of draw-reset marker-record was repeated 
until E had drawn 48 marbles from the urn. 
(f) The E then showed Ss which urn he had 
been drawing from. . He then progressed to 
the next trial by flipping another coin to 
select the next urn. This procedure con- 
tinued until the end of the i rita à 

Measure of probability revision.— n order 

to compare the subjective probability irn 
of Ss with the corresponding probabi ity 
revision as calculated by means of Bayes 
theorem, the data analyses required a see 
measure of revision. The measure used here 


was the same as that reported by Edwards 


and Phillips (1963). It is the logio of the 
likelihood ratio (LLR). The likelihood ratio 
is described above in Equation 2; it is equal 
to the ratio of the posterior to the prigr odds. 
Thus, the LLR of this experiment was equal 
to the log of the posterior odds in favor of 
Urn B minus the log of the corresponding 
prior odds. Since the LLR increases with the 
difference between the posterior and prior 
odds, it must also increase with the difference 
between the posterior and prior probabilities. 
It is thus a measure of revision from prior to 
posterior subjective probabilities; the meas- 
ure is positive if the revision is upward and 
negative if the revision is downward. 

The LLRs for the present experiment were 
calculated as follows. Posterior probabilities, 
both subjective and Bayesian, were obtained 
after each trial of 48 marbles. These posterior 
probabilities were converted into odds in 
favor of Urn B, and were then transformed 
into log posterior odds. The definition of 
LLR calls for a subtraction of log prior odds 
from log posterior odds. However, since the 
initial probability of Urn B on each trial was 
-50, the prior odds were always 1 and the log 
Prior odds were 0. Therefore, the log posterior 
odds were equivalent to LLRs. An LLR 
calculated from Bayesian Probabilities will be 
termed BLLR: bne calculated from subjective 
probabilities will be termed SLLR. The 
BLLR in the Present experiment was equal to 
-17609 times the total number of black 
marbles minus white marbles, 

An important advantage of the LLR over 
the algebraic change in Probabilities as a 
measure of probability revision is that the 
LLR is Proportional to the amount of evidence 
in favor of one hypothesis over the other. 
Algebraic change in probabilities, on the other 
hand, is influenced bya ceiling of certainty; 
equal increments of evidence result in smaller 
expected algebraic Probability changes near 
certainty than near uncertainty, 

Relation of SLLR to BLLR.—Two depend- 
ent variables related SLLRs to corresponding 
BLLRs. The Primary dependent variable, 
the accuracy ratio, was the ratio of SLLR to 
BLLR at the end of each trial of 48 marbles. 
If S changed his Probabilities in the same 
algebraic magnitude as did Bayes’ theorem, 
then his accuracy ratio would be 1. To the 
extent that S Changed less than Bayes' 
theorem, his accuracy ratio decreased, 

A second dependent variable, linearity, 
was the correlation between corresponding 
within-trial BLLRs and SLLRs. The LLRs 
were based on Probabilities posterior to each 
sample, i.e., posterior to each probability 
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revision. A correlation was calculated for 
each trial of sample sizes 1 and 4; there were 
too few within-trial LLRs for sample sizes of 
12 and 48, 


RESULTS 


Analyses of variance—In order to 
evaluate the significance pf the effect 
or sample size upon the accuracy of 
Subjective probability revision, anal- 
Yses of variance were applied to the 
accuracy ratio and to linearity. The 
two main effects for these analyses 
Were sample size and blocks of trials. 

hese analyses of variance are pre- 
Sented in Table 1. Whenever the 
Dumber of black and white marbles 
Was exactly equal, BLLR had a value 
9f 0 and thus the accuracy ratio was 
Indeterminate, As a constantly con- 
Servative procedure, each indetermi- 
nate value was replaced by the grand 
Mean of the data in the analysis o 
Variance and the degrees of freedom 
associated with the highest order 
“ror term were reduced by one. — , 

‘able 1 indicates that sample size 
Pi a significant (P € .001) effect 
Pon the accuracy ratio. No other 
Sie atio reached the .01 level of 
"IBnificance. 
ds escription of effects —Table 2 pre- 
nts the mean value of each depend- 


SIZE 


Mean accuracy ratio as a function of sample size. 


ent variable associated with each level 
of the independent variables. Figure 
1 illustrates the function relating 
accuracy ratio to sample size. This 
ratio decreases rapidly at first, and 
then more slowly, as sample size 
increases. 

Although the LLR is the more ade- 
quate measure of probability revision, 
Fig. 2 further illustrates the effect of 
sample size by a more direct measure 
— absolute probability change. The 
horizontal axis indicates the absolute 
difference between black and white 
marbles at the end of each trial. The 
vertical axis indicates the net absolute 
revision from the prior probability to 
the posterior probability at the end of 


TABLE 1 


ANALYSES OF VARIANCE ON ACCURACY 
RATIO AND LINEARITY 


Accuracy Ratio Linearity 
Source 
af F df F 
Sample Size 
(A 3/129 | 13.72*** 1/43 | 4.04 
Blocks (B) 7/301 | 1.32 7/301| 2.57* 
AXB 21/903 | 1.78 7/301) «1 
*p <.05. 
+*+ p < 001. 


on 
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TABLE 2 
MEAN VALUE FOR EACH DEPENDENT 
VARIABL ZACH LEVEL OF THE 
NDENT VARIABLES 
Accuracy Ratio Linearity 
Sample Size 1 470 -915 
Sample Size 4 361 .947 
Sample Size 12 53 — 
Sample Size 48 255 — 
Block 1 .349 .929 
Block 2 .342 .910 
Block 3 327 951 
Block 4 305 .917 
Block 5 .283 .943 
Block 6 .328 .917 
Block 7 363 .940 
Block 8 .384 .944 


the trial. The solid curve represents 
the correct revision as calculated by 
means of Bayes’ theorem. The three 
dotted lines are smooth curves in- 
tended to represent the data points 
associated with sample sizes of (a) 1, 
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(b) 4, and (c) 12 and 48. The observa- 
tion base varied across data points, 
and was relatively small when abscissa 
values were more extreme than 2 and 
22. Figures 1 and 2 tell the same 
Story. Subjective probability re- 
vision lagged behind Bayesian rce- 
vision; amount of revision decreased 
from sample size 1 to 4, decreased from 
4 to 12, and remained constant from 
12 to 48. 

Means and individual differences.— 
The first two rows of Table 3 present 
the mean and the standard error of 
the mean for the accuracy ratio and 
linearity. In order to evaluate the 
consistency of individual differences 
across situations, scores were com- 
pared among all levels of sample size. 
By collapsing over blocks, one score 
was obtained for each Sin each sample 
size. Correlating each pair of sample 
sizes across Ss resulted in a matrix of 
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—— Boyes Theorem 
o Sample Size | 
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Net absolute probability change as a function of the absolute difference between 
(A verticle line through a solid data point indicates 
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six correlations. The average of these 
six correlations is presented in the 
bottom row of Table 3, indicating the 
extent to which an S's performance in 
one sample size could be predicted by 
his performance in another sample size. 
With df = 42, a correlation of .38 is 
significantly different ($ < .01) from 
Zero. 


DiscussION 


The primary purpose of the present 
experiment was to investigate the shape 
of the function which relates accuracy in 
the revision of subjective probabilities to 
the size of the sample of data upon which 
the revision is based. The graphs of Fig. 
land 2 answer this question. Accuracy 
of subjective probability revision de- 
creases with a negatively accelerated 
slope as sample size increases. 

With reference to the information proc- 
essing findings discussed earlier, these re- 
Sults provide another instance in which 
the accuracy of human information trans- 
Mission decreases as the amount of in- 
formation to be processed per observa- 
tion increases. However, even though 
accuracy decreases, an increase in sample 
Size results in more data processing per 
observation, Changing from sample size 
48 to sample size 1 in the present experi- 
ment resulted only in à doubling of 
accuracy at the cost of 48 times as many 
Observations. Thus, greater revision per 
observation results from increasing sam- 
Dle size, but at a cost in accuracy. 

, The lack of effect of any variable upon 
linearity means that 5$ in the present 
Experiment were equally linear xim 

ayes’ theorem, regardless of the level [n 
Sample size or block. Of course, caution 
1$ to be used in the interpretation of this 
meaning of linearity: Any variance 1n 
Subjective probability revision which was 
Not accounted for by Bayes’ theorem may 

e due either to a systematic nonlinear 
relation or to error variance around a 
Straight line. However the relatively 
Consistent magnitude of correlations re- 
Ported in Table 2 (all mean correlations 
in the table fall between and 95) 
implies an important degree of linearity 


TABLE 3 
Mean, VARIABILITY, AND Consist 
MEASURES ACROSS SS FOR THE 
Accuracy RATIO AND 
LINEARITY 


ENCY 


Accuracy Ratio Linearity 
M 334 .933 
9M .039 .009 
Consistency 075 518 


between Bayes’ theorem and the revision 
of subjective probabilities. 

Finally, the experimental results pro- 
vide further support for the previous 
finding that the revision of subjective 
probabilities lags behind the correspond- 
ing optimal revision as calculated via 
Bayes’ theorem. Fairly high linearity 
seems to characterize this relation. Some 
Sslag much more than others; the degree 
of lag is significantly consistent from one 
sample size to another. Some Ss are 
more Bayesian than others. 
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EFFECTS OF POSTRESPONSE STIMULUS DURATION 


UPON DISCRIMINATION LEARNING 
IN HUMAN SUBJECTS? 


DONALD J. DICKERSON axp NORMAN R. ELLIS 
George Peabody College for Teachers 


3 experiments, designed to explore temporal relations in instrumental 
conditioning, are reported. The variable manipulated was postresponse 
stimulus duration (PSD), the length of the interval from S's response to 
the offset of the discriminanda. The tasks were 2-choice discrimination 
problems. In a ist experiment PSDs were 0.0, 0.5, 1.0, and 2.0 sec. 
Normal and retarded human Ss were assigned to each PSD group. A 
2nd experiment was similar to the 1st except the duration of the rein- 
forcing light was changed from 1.0 sec. to 3.0 sec. A 3rd experiment 
differed in several ways from the 1st 2. The cues for the task were 
different, normal Ss only were used, a buzzer instead of the light served 
as a reinforcer, and the PSDs were 0.0, 0.2, 0.5, 0.75, and 1.0 sec. 
main findings show that PSD has a marked effect upon discrimination 
learning of the type studied. The optimal interval was 0.5 sec. 
of shorter and longer durations led to slower learning. Normal Ss 
learned faster than retarded Ss, although no interaction effects prevailed. 


Temporal relations between events 
in classical conditioning have received 
extensive study, and the main findings 
have been summarized by Kimble 
(1961). On the other hand, temporal 
aspects of instrumental conditioning 
have received little experimental at- 
tention. The most effort has focused 
on delay of reinforcement. Although 
numerous investigators have dealt 
with comparisons of simultaneous 
and successive discrimination learn- 
ing, such studies are complex and 
appear to involve variation along 
other dimensions. 

Recent experimental evidence in- 
dicates that variation of temporal 
relations between stimuli, responses, 
and reinforcements has marked effects 
upon discrimination learning. Chow 
and Orbach (1957) and Riopelle and 
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Addison (1962) have studied the 
effect of stimulus duration upon the 
rate of simultaneous discrimination 
learning in rhesus monkeys. In the 
main, these studies show that learning 
is facilitated by longer exposure to 
the discriminanda. Learning rate in- 
creases as stimulus duration increases 
from 0.1 sec., reaching an asymptote 
at around 1.0 sec, Sheridan, Horel, 
and Meyer (1962) report that the 
postresponse persistence of either the 
positive or negative cue produces 
learning superior to that observed 
when both or neither of the cues 
persist. 

The present experiments assess the 
effects of postresponse stimulus dura- 
tion (PSD) upon two-choice dis- 
crimination learning in human Ss. 
PSD is the temporal duration of the 
discriminanda following S’s response 
on both reinforced and nonreinforced 
trials. The reinforcing stimulus im- 
mediately follows correct responses. 
Perhaps in many learning situations 
cues are withdrawn with the response 


528 


POSTRESPONSE STIMULUS DURATION AND LEARNING 


and are contiguous neither with the 
reinforcement nor the terminal phase 
of the response. Thus, such a task 
somewhat parallels the trace classical 
conditioning model, and the formation 
of an association may depend upon a 
"memory trace" of the initial event. 
Ellis (1963) has adduced evidence 
suggesting that mentally retarded Ss 
have inadequate short-term memory 
processes. Therefore, such Ss may be 
more adversely affected by noncon- 
tiguity of events than are normal Ss. 


EXPERIMENT I 
Method 


A pparatus.— The Ss were tested inaS X8 
ft. cubicle. The S console was a metal 
cabinet, 16 X 19 in. and 12 in. high, with a 
top that sloped toward S. Two miniature 
stimulus projectors were mounted 2 in. apart 
in the sloping top. The viewing surfaces 
(frosted Plexiglas) of the projectors are round 
and approximately 1 in. in diameter. Any 
One of seven patterns or five colors or à com- 

ination of pattern and color may be dis- 
Played with each projector. A push-button 
type switch was located below each projector. 

green pilot-type lamp mounted 6 in. above 
the projectors in the console served as a rem- 
Orcing stimulus, The console was on à table, 
and S sat in a chair in front of the table. 
Electromechanical circuitry located in an 


adjoining room provided automatic control 
9r the intertrial interval, the onset of the 
Stimuli, the PSD, the recording of responses, 
and the occurrence and duration of the rein- 
Orcing stimulus. 
Subjects.—The Ss were 80 undergraduate 
PSychology students and 80 mildly retarded 
Students from a state residential facility. The 
As of the normal Ss ranged from 17 yr. 9 
Yr. with a mean of 20.6 yr. The psy ol 
the retarded ranged from 15 yr. to 2 m 
vith a mean of 16.6 yr. Their adl range 
rom 70 to 85 with a mean of 78.1. 
Pratar ihe stimuli were à guns. 
triangle, circle, vertical bar, and horizonta 
ar presented on backgrounds of blue, green, 
Yellow, red, and light gray» making à total of 
Stimuli, Five were selected arbitrarily as 
correct, and 1 of the 5 correct and 1 of Se 
tri Incorrect stimuli were presented on one 
ilt All 5 correct stimuli appeared ig eee 
as rial block, although incorrect stimu e 
Signed at random and did not appear con 
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The 
serial order and the position of correct stimuli 
varied randomly within trial blocks. The 
programing equipment provided for a series 
of 25 unique trials which was repeated without 
interruption until S either reached a criterion 
of 10 consecutive correct responses or com- 
pleted 200 trials. 

The S was seated before the panel and was 
told that he was to try to press the button 
under the "right" picture as many times as 
he could, that when he pressed the button 
under the "right" picture the light (rein- 
forcing) would come on, and that when he 
pressed the button under the “wrong” picture 
the light would not come on. The S was free 
to respond after the two stimuli appeared. 
The reinforcing stimulus immediately fol- 
lowed a correct response and was of 1.0 sec. 
duration. The discriminanda persisted fol- 
lowing both reinforced and nonreinforced 
responses. The PSDs were 0.0, 0.5, 1.0, and 
2.0 sec., and Ss from each of the two groups 
were assigned equally and without bias to 
these subgroups. The intertrial interval was 
a constant 10 sec. and was measured from the 
offset of the discriminanda on one trial to the 
onset of the discriminanda on the next. 


sistently with particular correct stimuli. 


Results 

Number of correct responses in 200 
trials was the dependent measure. 
The Ss reaching criterion were as- 
sumed to perform perfectly beyond 
the criterion. Figure 1 depicts mean 
percent correct responses for the 
eight subgroups. Performance of the 
normal Ss is superior to that of 
the retarded Ss, F (1,152) = 15.22, 
p < .001, and the PSD effect is also 
statistically significant, F (3, 152) 
= 6.25, p < .001. The latter effect 
is due to the superiority of the per- 
formances of the 0.5-sec. and 2.0-sec. 
subgroups over those of the 0.0-sec. 
and 1.0-sec. subgroups. No other 
differences reach statistical signifi- 
cance. The interaction between abil- 
ity level and PSD is not significant. 
Group curves indicate that the super- 
ior performance of the 0.5-sec. and 
2.0-sec. subgroups appears within the 
first 40 training trials and persists 
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throughout training. All subgroups 
show improvement over trials. 

These results are not in agreement 
with general expectations; perform- 
ance did not increase consistently with 
PSD, and there is an absence of an 
interaction effect. Perhaps the most 
notable effect is the elevated perform- 
ance in the 0.5-sec. subgroups. The 
performances of the 1.0-sec. 


yoorer 
ave resulted from the 


ay hi 
subgroups me rom th 
simultaneous offset of the discrimi- 


nanda and the reinforcing light at that 
PSD (ie., perhaps some interference 
es 


occurred). 


Performance of normal and retarded Ss at each of the PSDs in Exp. I. 


EXPERIMENT II 
Method 


Subjects —The Ss were 40 undergraduate 
psychology students and 40. retarded students. 
None had Participated in Exp. I. The re- 
tarded students were from a state residential 
school. The CAs of the normal Ss ranged 
from 18 yr. to 27 yr. with a mean of 19.4 yr. 
The CAs for the retarded Ss ranged from 15 
yr. to 20 yr. with a mean of 16.2 yr., and their 
IOs ranged from 55 to 85 with a mean of 67.2, 

Procedure.—'The Ss from each group were 
assigned to PSD subgroups of 10 Ss each. 
The PSDs were the same as those in Exp. I. 
The duration of the reinforcing stimulus was 
extended to 3 sec., and the intertrial interval 
was measured from S's response on one trial 
to the onset of the discriminanda on the next. 
Otherwise, the procedure was identical to 
that of Exp. I. 
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POST-RESPO 
Fic. 2. Performance of normal 
Results 


Figure 2 shows mean percent cor- 
rect responses in 200 trials for each 
of the subgroups. These results are 
similar to those of EXP- T. Perform- 
ance of the normal Ss is superior to 
that of the retarded Ss, F (1, 72) 
5n 18.90, p < .001, an 
is significant, F (3, 72) 
The ability level and 
is not statistically sig" 
differences again appear e 
ing and persist throughout. . 

The general form of the curves in 
Exp. I and II is quite similar. The 


NORMALS 69————6 


DEFECTIVES O————O 


1.0 2.0 


NSE STIMULUS DURATION (IN SEC.) 
and retarded Ss at each of the PSDs in Exp. Il. 


effect of the reinforcing stimulus offset 
is difficult to evaluate since there are 
some overall group differences be- 
tween Exp. I and II. However, it 
apparently produced little, if any, 
effect. Again, the most marked result 
is the superior performance of the 0.5- 
sec. subgroups. 


Exrerment HI 
Method 
Subjects —The Ss were 60 undergraduate 
psychology students. Their CAs ranged from 
17 yr. to 25 yr. with a mean of 19.2 yr. None 
of these Ss had participated in Exp. | and Il. 
Procedure.—Several procedural changes 
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were introduced to extend the generality of 
the results of Exp. I and II. The stimuli were 
six patterns with two identical patterns ap- 
pearing on each trial, one on a red background 
and the other on green. For three of the six 
patterns, green background was Correct, and 
for the other three, red was correct. Each 
pattern appeared in each 6-trial block. Back- 
ground color was assigned at random to posi- 
tion, and also serial. order of pattern was 
assigned randomly. The Programing equip- 
ment provided for a series of 52 trials which 
could be repeated without interruption. 

The reinforcing stimulus was a buzzer 
sounding for 3 sec. The intertrial interval 
$ constant 8 sec., and was measured as in 
Exp. II. The PSDs were 0.0, 0.2, 0.5, 075 
ed ET Ten Ss were assigned without 
E to each of the five groups. Each S was 
n d to a criterion of 11 correct responses 
gue ny dirti 6-trial blocks or for 102 
1n 


trials. 
Results 


Figure 3 presents percent poore 
responses in 102 trials for the five 


Performance of Ss at each of the PSDs in Exp. III. 


groups. It is clearly apparent that a 
PSD of approximately 0.5 sec. yields 
optimum performance within the time 
range studied. An analysis of vari- 
ance of the data yields à significant 
PSD effect, F (4,55) = 6.02, p < .001, 
and ¢ tests show that the 0.5-sec. and 
0.75-sec. groups are Superior to the 
others. This difference again appears 
early in training and persists through- 
out. No other differences between 
Sroups are significant. 


Discussiox 


These results demonstrate fairly clearly 
that discrimination learning of the type 
studied is substantially influenced by 
PSD. It appears that a PSD of 0.5 sec. 
yields optimum performance and that 
longer intervals yield poorer performance 
up to approximately 1.0 sec. after which 
it may again increase. The main find- 
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ings of Sheridan et al. are not in conflict 
with those of the present experiments 
since these Es found no difference in 
learning between conditions involving 
PSDs of 0.0 and 2.0 sec. 

The finding that a PSD of 0.5 sec. 
yields optimum performance is reminis- 
cent of a similar temporal effect deriving 
from studies of the interstimulus interval 
in classical conditioning. An inter- 
stimulus interval of around 0.5 sec. yields 
optimum conditioning (Kimble, 1961). 
However, the relationship between the 
optimum PSD and the optimum inter- 
stimulus interval may be fortuitous since 
in the present experimental situation the 
interval between the onset of the CS 
(discriminanda) and the onset of the 
UCS (reinforcing light) is determined by 
the latency of S's response. 

Perhaps a number of hypothe 
be summoned to account for th i 
None seem particularly parsimonious nor 
adequate, However, a notion recently 
expressed by Miller (1963) may prove of 
value in considering the effects of rela- 
tively minute temporal variations 1n 
learning tasks. Itis conceivable that the 
“perception” of a stimulus involves a 
neural change which requires a certain 
amount of time. The occurrence of 
another stimulus event while the first is 
being “processed” may interfere. Such 
Speculation would suggest a more careful 


ses could 
e results. 


z23 


2923 


analysis of such phenomena as delay of 
reinforcement. Perhaps immediate rein- 
forcement is detrimental to the learning 
process in comparison with slightly 
delayed reinforcement. 
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SUPPLEMENTARY REPORTS 


STIMULUS DURATIONS AND TOTAL I 


EARNING TIME 


IN PAIRED-ASSOCIATES LEARNING ! 


CALVIN F. NODINE 


Miami University 


The present report supplements an earlier investigation concerned with 
the role of exposure durations of stimulus and response members on the 


acquisition of a paired-associ 


tes list using the anticipation method. 


Previously, Ss were given a fixed number of learning trials, and exposure 
durations of stimulus members alone (St duration) and exposure dura- 
tions of stimulus and response members paired (St-R duration), were 


found directly related to rate of learning. 


In the present experiment, 


Ss’ performance was equated by employing a performance criterion, 
and effects of independent variations in St, St-R durations were assessed 


using total learning time as the response me: 
refers to Trials X Exposure Time. 

durations were 4, 1, 2, and 4 sec. 
directly related to total learning time, 
learning rate when Ss were equated in terms of performance. 
versely, St-R durations were found invariant with 


sure. Total learning time 
The values of both St and St-R 
Stimulus durations were found 
but had a nonsignificant effect on 
Con- 


respect to total 


learning time, but directly related to learning rate. 


In an earlier paper, Nodine (1963) found 
an cightfold increase in exposure duration of 
stimulus members alone (St duration) and 
stimulus and response members paired (St-R 
duration) from 3:3 to 4:4 sec. produced 
slightly less than an eightfold increase in 
numbers of correct responses. Total learning 
time for .Ss in the former exposure-duration 
combination at the end of 21 trials, however, 
was 5.5 times greater than that for Ss in the 
latter combination. Unfortunately, data 
limitations precluded further extrapolations 
of the data, but suggested that some inter- 
mediate St, St-R combination might provide 
an optimum relationship between learning 
performance and total learning time. 

The present experiment extends Nodine’s 
previous findings by assessing the effects of 
independent variations in exposure duration 
of stimulus and response members on Total 
Learning Time (TLT) defined as Trials to 
Criterion X St, St-R Exposure Time. Of 
interest also are the effects of these variables 
on stimulus term (R-S) recall. 

Method.—Vhe conditions, apparatus, ma- 
terials, and procedure of the present. eda 
ment differed from those of a int n 

details: use of 5 rather than 10 Ss is cach 
of the 16 St, St-R combinations; Ss taken toa 

1 This research Md Uo da ee (d 
Service Research Gran ail The author wishes to 
National Institutes o to Fred Farzanegan for 


express his appreciation 
collecting the data. 


15/16 learning criterion rather than a fixed 
number of trials; and the addition of an R-S 
recall trial using a 4:4-sec, St, St-R combina- 
tion immediately following the criterion trial 
during which only response members were 
presented and Ss were required to supply the 
stimulus member for cach pair. The values 
of St, St-R durations remained the 
3, 1, 2, and 4 sec. 

Results and discussion.—Analysis of vari- 
ance of correct responses for the first 20 trials 
in the present experiment (equivalent to 
Nodine's, 1963, study) revealed a pattern of 
significant effects similar to those of Nodine's 
earlier experiment. The effect of St-R 
durations was highly significant, F (3, 64) 
= 47.87, p « .01. Holding St durations 
constant, a pattern of increasing numbers of 
Correct responses was obtained as St-R 
durations increased from 3 to 4 sec. This 
pattern was also present as St durations in- 
creased, but less markedly. The effect of St 
durations, as a consequence, fell short of 
statistical Significance, F (3, 64) = 1.01. 
Correct responses increased significantly from 
Trial Block 1 to Trial Block 4, F (3, 192) 
= 34045, p < 01. Trials also interacted 
with exposure durations reflecting increasing 
increments in numbers of correct responses 
from Trial Block 1 to Trial Block 4 as both 
St and St-R durations increased. The slope 
of the curve over trials for St-R durations was 
steeper, however, than that for St durations. 


same; 
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Response latencies were found to be significantly over trials in a negatively 
directly related to St-R durations. An anal- accelerated fashion and interacted with both 


ysis of variance 


of reciprocals of latencies St and St-R durations. 


revealed a significant effect for St-R durations, Figure 1 shows the relationship between 


F (3,64) = 30.70, p «.01 but not for St T 


T and each of the 16 St, St-R combina- 


durations. Latency scores also decreased tions. An analysis of variance was performed 
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TLT. The F for St durations was highly 
wentficant, F (3, 64) = 15.10, p< .01. Nei- 
ther St-R nor the interaction was significant. 
The highly significant F for St durations sug- 
gests this factor was the primary contributor 

LT. 
"n comparison of means for each St, St-R 
combination using the Tukey procedure 
showed significant differences between }- and 
4-sec. St durations, F (4, 64) = 8.29, p < .01, 
and 1- and 4-sec. St durations, F (4, 64) 
= 7.89, p < .01, indicating that }- and 1-sec. 
St durations made considerably less con- 
tribution to TLT than did either 2- or 4-sec. 
St durations in combination with St-R dura- 
tions. A trend analysis of the St duration 
main effect revealed a highly significant lincar 
component, F (1, 64) = 41.51, p < .01, sug- 
gesting that an increasing linear function best 
described the relationship between St dura- 
tions and TLT. 

In general, it appears that the 3- and 1-sec. 

St durations combined with 4- and 1-sec. 
St-R durations produced the most efficient St, 
St-R combinations with respect to TLT. 
From Fig. 1, it is evident that the 1-sec. St-R 
curve was least affected by St durations, 
When performance was equated for all com- 
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binations in the present study, the i:jeec. 
combination had the shortest TLT. In con- 
trast, when trials rather than performance 
was equated, Nodine previously found the 
3:4-sec. combination ranked last in number 
of correct responses while the 4:4-sec. com- 
bination ranked first. Clearly, from the 
present findings, the 2:2.sec. combination 
which is conventionally employed in PA tasks 
is not as efficient as either the j-sec. or I-sec. 
St, St-R combinations when performance is 
considered in relation to TLT. It is also 
evident from these data that St durations arc 
not invariant with respect to TLT. For St-R 
durations, TLT appears to follow no consist- 
ent trend. From these data, it appears that 
St-R durations are important determiners of 
learning rate while St durations beyond 1 sec., 
inflate TLT without accompanying increases 
in performance. Finally, the analysis of the 
R-S recalls showed no significant effects sug- 
gesting that neither St nor St-R durations 
influenced the recall of stimulus terms. 
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PRIMARY STIMULUS GENERALIZATION 1 


ALBERT F, HEALEY? 


University of Massachusetts 


Rats were trained to run to food in a straight runw 
deprivation, with a 400-cps tone sounding and a 79- 
- Generalization test trials 
levels of deprivation (23 hr. and 12 hr. 
(400 cps and 200 cps, and 2 levels of th 


in front of the food cup. 


32 sq. cm.). 


than varying either singly. The latte: 


lowered generalization gradients, 
fashion. 


ti lization was 
‘compound stimulus genera y 
"E Wen in the straight runway using 
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chuserts; Institutes of Health Predoctoral fellowship. 
National Ins, indebted to Seymour Epstein for his 
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Log Now at Princeton University. 


Variation along the 2 stimulus dim 


ay under 23 hr. food 
sq. cm. white square 
Were carried out at 2 
), 2 levels of the tone dimension 
e size dimension (79 Sq. cm. and 


ensions produced more decrement 
r, compound, effect was appar- 


Decreasing deprivation time reliably 
again 


in an apparently additive 


visual size and fr 


equency of a pure tone as two 
dimensions of 


the CS. Fink and Patton 
(1953), with rats, and White (1958), with 
children, found an increasing generalization 
decrement in operant responding as more CS 
dimensions were altered, thus supporting 
Hull's (1952) theoretical expectation. Butter 
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(1963), with pigeons, demonstrated a multi- 
plicative compound effect. 

Also studied here was the effect of varying 
drive strength (hours of food deprivation) on 
generalization along both stimulus dimensions. 
Brown (1942) and Newman (1955), both with 
rats in a straight runway, reported changes in 
the heights of gradients with alterations in 
drive level while Hull's (1952) H X D formu- 
lation requires both height and slope change 
with drive level. Both predicted outcomes 
were obtained by Porter (1962) in the human 
cyelid-conditioning situation. 

Method.—Vhe apparatus was à straight 
runway, 24 in. long, 4 in. wide, and 6 in. deep, 
With a separate 12-in. start box. At the goal 
end was a metal panel with a 1}-in. square 
door in its center allowing access toa food cup. 
The door, and part of the area around it were 
Painted flat white to form a 32-sq. cm. white 
square; a second panel could be slid in front 
of the first to form a 79-sq. cm. white square. 
Alongside the runway was a mounted 4-in. 
loudspeaker which emitted a pure tone of 
either 200 cps or 400 cps at approximately 
85 db. 


One second after initiation of the tone, the 
Start door opened and a timer started. This 
timer stopped and a second started when S 
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interrupted a beam 1 in. from the start door. 
The second timer and the tone ceased when S 
pushed in the door on the stimulus panel. 

The Ss were 48 white albino rats, 100-110 
days old at the outset. They were fed for 
1 hr. each day with water always available. 
After adjustment to the apparatus, Ss were 
trained to make the door-pushing response to 
obtain a small, approximately constant, 
amount of wet mash. For acquisition trials, 
the tone was set at 400 cps, the 79-sq. cm. 
white square was in place and Ss were 23 hr. 
deprived. Two trials a day were given for 7 
days and 4 a day for the next 10 days to total 
54, at which point performance had stabilized. 
To test for generalization, four matched 
groups of 12 Ss cach were selected. Two 
groups were tested under the training de- 
privation level (23 hr.) and two shifted to 
12 hr. deprivation by interpolating a 1-hr. 
feeding session midway between the last train- 
ing day and the first test day. One of the two 
groups at each deprivation level was tested 
with the same tone as in training (400 cps), 
the other with a 200-cps tone. 

With temporal order counterbalanced 
within each of the four groups, all Ss were 
tested with the training square (79 sq. cm.) 
and, on a second test day, with the smaller 


50 mc RR 
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square (32 sq. cm.). Two recovery days, with 
acquisition conditions reinstated, intervened 
between the 2 test days. On each test day Ss 
were first given three training trials as usual, 
then three nonrewarded test trials. 

Results and discussion.—Results are re- 
ported only for running speed (1/latency) 
since starting-speed data yielded essentially 
the same effects. The Fs included are from an 
analysis of variance with two between-S 
variables, deprivation level and tone fre- 
quency, and repeated measurements with the 
size dimension and the three test trials. 
Figure 1 displays mean running speed aver- 
aged over test trials, for four combinations of 
levels of deprivation time and size as a 
function of tone frequency. 

Generalization decrements occurred with 
variation along the size dimension, F (1, 44) 

= 20.89, p < .001, and the tone dimension, 
F (1,44) = 4.55, p < .05. Varying both size 
and tone produced greater response decrement 
than varying either alone, confirming the 
theoretical expectation. The contrast for size 
alone vs. tone plus size, using a formula for 
independent means, yields a ¢ (46) = 2.35, 
b <.05. With a formula for correlated 
means, comparison of tone alone vs. tone plus 
size yields a t (23) = 6.32, p<.01. The 
stimulus dimensions did not interact, a finding 
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at variance with Butter's (1963) results and in 
agreement with the White (1958) study. 

Consistent with studies cited above, de- 
creasing deprivation time reliably decreased 
test-trial performance, F (1, 44) = 9.89, 
b «.005. Also, with Brown (1942) and New- 
man (1955), there was no significant inter- 
action of deprivation level with either the tone 
or the size dimension, although there was an 
observable trend in that direction with the 
size dimension (Fig. 1). 
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ING IN THE 


APPARENT ABSENCE OF MOTIVATION : 


MILTON A. TRAPOLD, SANDRA R. BLEHERT ?, ANp THOMAS STURM 


University of Minnesota 


This experiment is a replication of an earlier one by Robinson (1961) 


in which, subsequent to training rats to make bot 
bar-holding response to terminate the same sh 


extinction of the locomotor avo 


h a locomotor and a 
ock-paired stimulus, 


ocon idance response was shown to produce 
no transferred extinction of the bar- 


holding response. 


Robinson (1961) „has recently reported 
results which are seriously at — ray 
the widely accepted two-process t id ha 
discriminated avoidance learning : (Kim dle, 
1961). Briefly, Robinson’s experiment in- 

1 This research was supported by a grant from the 


versity of Minnesota. 
vaduate Research Fund, Unive! 
ow at Stanford University. 


volved (a) training rats to make two different 
Tesponses to terminate the same shock-paired 
CS, (b) extinguishing one of these responses 
in the experimental group but not in the 
control group, and (c) comparing the two 
groups on their performance over a series of 
extinction sessions on the second response. 
He found that the prior extinction of one 
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experi? ing for the experimental and contro 
presen Mental and control groups from | 

ited here were estimated from Fig. 


oes in the experimental group did not, 
tear be expected from the la ane 
the hy , lead to faster subsequent extinction o 
aoe response. This experiment 15 od 
criti an. to replicate as closely as possible the 
ical groups of Robinson's study. 
y mea Ss were 15 male and 15 
appa, € naive hooded rats 100 days old. Thie 
Cheeses was an automated shuttle box 
in, i rised of two identical compartments S 
or a 8 in. wide, 12 in. high, with grid gurs 
apad ee shock, metal walls, and o 
roni | 1 Y a permanent metal barrier to OE at 
he bi he floor, and by a guillotine door fr: 
; Jürrier to the ceiling. 
‘ Tus procedure involved first Ter 
ne from one compartment to ii — 
Suill E a S-sec. CS (light + buzzer po 
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IE training procedure were identical fs 
sh ee with the exception that pore ned 
in o Was a constant 375 v. ac (.2 mego = 
ko with S$) for all Ss rather than te 
d bu Subtetaniz ng" shock level, individua y 
pu en ted for cach S, employed by Robinson. 
me a with the doorway between compati 
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Were confined to one compartment and 
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STUDY 


12 14 16 18 20 22 24 


ock-pz ired CS off during acquisition 
tudy and of Robinson's (1961) original study. (The 
sre there labeled IA and In, respectively, and the data 


port.) 


The experimental group (N = 17) was 
then subjected to extinction of the hurdle- 
jumping response using Robinson's procedure 
exactly but with a limit of 24 extinction 
sessions if S failed to reach the extinction 
criterion. ‘The control group (N = 9) re- 
mained in their home cages during this phase. 
All Ss were then given 10 bar-holding test 
sessions under conditions identical to bar- 
holding acquisition. 

Results and discussion.— The experimental 
and control groups did not differ in the 
acquisition of cither the hurdle response or the 
bar-holding response. There were, however, 
contrary to Robinson's results, gross differ- 
ences between these two groups on the bar- 
holding tests after extinction of hurdling in 
the experimental group. These differences 
appear in Fig. 1 where the mean percentage 
of time the bar was depressed in a 5-min. 
session is plotted over sessions for Ss of this 
experiment and for the comparable Ss of 
Robinson's study. The slight difference be- 
tween Robinson's groups was not significant 
over the 25 tests, nor was there any evidence 
of extinction in either of his groups. The ex- 
perimental group of this study, on the other 
hand, showed a significant decrease from the 
last 2 days of acquisition to the first 2 days of 
testing, t (16) = 8.04, P <.001; and was 
significantly below the control group over all 
10 test sessions; £ (24) = 6.21, p < .001. 
Moreover, the control group here showed a 
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significant amount of extinction when the 
first three test sessions are compared with the 
last three; £ (8) 2.94, p « .05. . 
'The results of this study are consistent 
with the two-process theory of avoidance 
learning, but are clearly inconsistent with 
Robinson's data. Moreover, no good ex- 
planation for the latter is apparent. The only 
two procedural differences of note were the use 
here of a constant shock level, rather than 
individually determined “just subtetanizing” 
shocks, and the necessity here to use some 
shaping early in the acquisition of the bar- 
pressing response (which Robinson does not 
report having to use). It does not seem likely, 
however, that either of these differences is 
responsible for the differences in results, 

The change in shock was necessitated when 
in some preliminary research, several Ss were 
damaged because of large fluctuations, both 
within and between sessions, in their tetany 
threshold. Thus, when a thorough review of 
the shock-intensity literature revealed no sug- 
gestion of any qualitative differences between 
shock values in the “just subtetanizing” range 
and those slightly less intense, a fixed shock 
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value slightly below the lowest ever observed 
to produce tetany was substituted. 

Similarly, the writers are aware of no 
evidence to suggest that shaping would pro- 
duce a response so much more susceptible to 
transferred extinction effects, and this is 
further borne out by the absence in the present 
data of any systematic differences between Ss 
who were or were not shaped. 

There were, additionally, slight differences 
between this study and Robinson’s in the con- 
struction of the apparatus, the composition 
of the CS, and the strain of rats, but again 
there is no good reason to think that the 
grossly different results were due to any of 
these, and no other explanations are apparent 
to the writers, 
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i PARAMETERS OF PAIRED-ASSOCIATE VERBAL LEARNING: 


| * LENGTH OF LIST, MEANINGFULNESS, RATE 
OF PRESENTATION, AND ABILITY ! 


JOHN B. CARROLL AND MARY LONG BURKE 


Harvard. University 
associates in a factorial design 


), 3 levels of meaningfulness, 
, 2 levels of ability, and 2 equiv- 


144 10th-grade Ss learned verbal paired 
with 4 list lengths (4, 8. 12, and 20 p: 


3 rates of presentation (3, 4, and 8 sec. 3 v 
alent forms. Dependent variable was amount of time to learn each pair 


to 1 correct anticipation, summed over the pairs in a list; logarithms of 
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entation, and ability level in a single 
factorial experiment. At the same 
time, an effort was made to avoid 
complexity by selecting stimulus and 
response terms at the same levels of 
meaningfulness, setting presentation 
(S term only) and confirmation (S 
term plus R term) periods equal, and 
eliminating all intertrial rest periods. 
The problem posed was to attempt to 
formulate a general equation that 
would permit the estimation, as 
accurately as possible, of learning 
time as a function of the four variables 
selected for study. 


METHOD 


Subjects.—The Ss were 144 students ina 
large suburban high school, randomly selected 
from the entire tenth-grade class. Boys and 
girls were about equally represented. In 
study hall periods, Ss were given Part V, 
Paired Associates, of the Modern Language 
Aptitude Test (MLAT) (Carroll & Sapon, 
1958) and divided into two groups—fast and 
slow learners—at the group raw score median 
(10.5). Kjeldergaard (1962a) had demon- 
strated that use of this test could increase the 
efficiency of experiments involving paired- 
associate learning. 

Materials.—The practice list (see below) 
consisted of 10 pairs of trigrams with asso- 
ciation values of 10-77% (Archer, 1960): 
WEY-LAV; MU! DAF-PIR; VOT-YAW; SIV- 
HUS; FAS-TEK; CEN-BIP; PAG-DUX; RUD-VIK; 
NAC-FEN. 

The lists of paired associates used in the 
main experiment were at three levels of mean- 
ingfulness: High, Medium, and Low. Pairs 
for the high-meaningfulness lists were pairs 
that had been missed by fewer than 50% of 
a group of high-school and college students 
in a “one-trial learning" experiment con- 
ducted by Kjeldergaard (1962b); the words 
in these pairs were nouns, verbs, or adjectives 
of varying length with frequency ratings of 
AA or A in Thorndike and Lorge (1944). 
Medium-meaningfulness lists used low-fre- 
quency five-letter words that Occurred at 
least .5 times, but less than 2 times per 
million on the Lorge magazine and semantic 
counts and on the Thorndike general count, 
Low-meaningfulness materials were trigrams 
with low (0-35%) association values (Archer, 


1960). 
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In the construction of lists, stimulus and 
response words were paired randomly under 
the restriction that members of a pair should 
not begin with the same letter or terminate 
with the same combination of letters, Fur- 
thermore, in any given list, it was true of 
both the stimulus terms and the response 
terms that no two terms began with the same 
letter or ended in the same combination of 
letters (with one or two exceptions, e.g. 
EIDER and UMBER). " 

Within each level of meaningfulness, lists 
of 4, 8, 12, and 20 pairs were used. Twelve- 
pair lists were constructed by random selec- 
tion of pairs from the 20-pair lists, 8-pair lists 
from the 12-pair lists, and 4-pair lists from the 
8-pair lists. There were two forms of cach 
list, containing entirely different pairs (Forms 
A and B) to minimize the possibility of list- 
specific results. The lists are shown in 
Table 1. 

For use in the Lafayette memory drum 
(Model 302B), each list was prepared in the 
form of three different random orders printed 
in capital letters (from an IBM 402 tabulator) 
on à continuous loop of paper. Presentation 
of each random order was considered as 
constituting one trial; trials succeeded one 
another without any break. 

Procedure.—The Ss were brought to the 
experimental room individually and ane 
structed in the use of the memory drum. ‘The 
anticipation method was used throughout the 
experiment; S was to try to say aloud (pro- 
nounce) the correct response before it was 
exposed by the shutter in the right-hand 
section of the window in the memory drum. 
The Ss were allowed to use any reasonable 
Pronunciations of the response terms, For 
warm-up, each .$ was given 15 trials on the 
practice list at a 4-sec, presentation rate 
(2 sec. for presentation of the stimulus term 
alone, and 2 sec. for presentation of the re- 
Sponse term along with the stimulus term). 
Each S was then required to learn a particular 
experimental list at a particular rate. of 
presentation—3, 4, or 8 sec. per pair, the time 
being equally divided between the stimulus 
Presentation and the stimulus-response pres- 
entation. The criterion of learning was one 
Perfect trial, but the experiment was ter- 
minated after 40 trials if this criterion had not 
been achieved. ‘The correctness of each 
anticipation was noted, a failure to anticipate 

eing recorded as incorrect. 

Design—The materials and procedures 
described above implied a 4 43x 3x2 x2 
factorial design ; each of the 144 Ss was tested 
under a unique condition. Preliminary 
analyses of the results indicated that there 
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TABLE 1 


EXPERIMENTAL LISTS 


No. of High Meaningfulness Medium Meaningfulness Low Meaningfulness 
Pairs ES 
RN NN Form A Form B Form A Form B Form A | Form B 
PAIN COLD — DELFT MERLE WHELK ILIUM FYM RUV 
FAGOT WOOER | FACET LEACH ZER GEX 
EGRET KETCH BASIL OVATE YIT FIK 
| UNCUT ANODE | SWALE FIBRE | NAFZOD | NAX ZAB 
VAIN GLAD SOUSE LLANO LIMBO VISOR TAH YEM 
ESCAPE WIN PHIAL NADIR INGOT CAPON HAJ KIV 
| CAPITAL GUARD YUCCA TRYST TU! MANSE BOJ JUB 
PIECE OFFER TAUPE REFIT NONCE ETUDE VOP LEQ 
GRUEL EIDER PAYER ALACK | GAC TUZ | KIG SAJ 
LEAPT UMBER d RUH VAW | RYX VUS 
WINCH HOURI EERIE NATAL | SIJ DYB | GOX TAZ 
SECOND GRAIN AXIOM OATEN VENAL BLEAR | KEBSEJ | SIW DEJ 
MOUTH HAIR REACH JOIN | s GROAT SULLY | LAJ MUP | PYB BOF 
CREATE STRA DETAIL EXPECT | MIDGE DAVIT K JOIST JOZ NAQ | JID NUK 
20 HOLD JABOT IDIOM ODIUM KRAAL | QAZ COG | DYF PIB 
DUCAT RENAL | PEF BAV | QOT CEG 
ARRAS TRI CUG WIB | WYD HIJ 
FASHION GENTLE | BIGHT PRAWN KRONE HORNY | MIV QEN | CEF WUG 
| DREAM HIGH KULAK FLUME | MILCH DOUSE | DUJ POB | LUJ MEB 
| LAW DESTROY NITRE JUNTO JUNTA PRONG | WOX HYF | MYP QUM 


gth L=4, 8 


Were 


eai significant differences (p> .05) be- 
tinum he two forms of any list with respect 
as the gjer of trials to criterion (40 being used 
Derfe Score for those Ss not attaining à 

ct trial), and thus list form was em- 


Ploye, ahi 3 x 
ar Yed as a within-cell variable in subsequent 
talyses, 


RESULTS 
an ember of trials to criterion—In 
as out of the 144 Ss failed to 
Wider criterion in 40 trials. It is 
."ICent from Table 2 that low mean- 


in 
c Sfulness and list length were the 
lef factors associated with failure to 


, 8, 12, or 20, is presented here as the first L pairs ina column. The response 
always the second member of the pair. 


achieve criterion. A four-way anal- 
ysis of variance using as the dependent 
variable the number of trials per- 
formed by each S (whether or not he 
attained criterion) showed all main 
effects significant (P < .01), with no 
significant interactions. Because of 
the ambiguity of this dependent vari- 


able, however, these results deserve 


little attention. 
Time scores with missing data. sup- 


plied by extra polation —Time to learn 
a specified number of pairs was con- 


TABLE 2 


rAGE OF SS FAILING TO 


REACH CRITERION IN 40 TRIALS, 


PERCI 
E py LEVELS OF MAIN EFFECTS 
Mean i m" 
fulness % List Length* % Rates (ee) % ability % 
wih z3 " 2.8 8 18.7 Fast 20.8 
È edium 14.6 8 16.6 4 14.6 Slow 23.6 
oY 457 12 27.8 3 33.3 
= U 20 41.6 
ay = ae eum level. 
bi eac] 
aN = 48 each cae 


72 each level. 
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sidered a more meaningful and useful 
measure of learning than number of 
trials to criterion. In the case of Ss 
who failed to attain the overall cri- 
terion in 40 trials, it was necessary to 
estimate by extrapolation the number 
of trials that they would have required 
to attain a given number of pairs (i.e., 
to have correctly anticipated each of 
that number of pairs at least once).* 
It was observed that for any given S 
there was an approximately linear 
relation between the number of items 
learned (i.e., correctly anticipated at 
least once) by a given trial and the 
logarithm of the number of that trial: 
if this relation were perfect it would 
be described by the equation 


log t = q + bn, [1] 
where / — trial number, 2, = the 
number of items anticipated correctly 
at least once at any time from Trial 2 
through Trial ¢, and g and b are 
parameters pertaining to the in- 
dividual S and the particular condi- 
tions of the experiment. For each .S 
who had not correctly anticipated 
every item at least once by the time 
he terminated the experiment, a 
least-squares fit (minimizing the sum 
of the squares of the differences be- 
tween predicted and actual trial 
numbers expressed logarithmically) to 
the above equation was made for all 
available data points (£, »;) and the 
missingdata points supplied byextrap- 
olation. Extrapolation was found to 
be necessary in only 21 cases, all 
having list lengths of 8, 12, or 20: in 
only 8 cases were more than three 
data points missing. Over the 144 


? [t should be noted that this is a slightly 
different criterion from that of a perfect trial 
for that number of pairs. Several Ss who did 
not meet the criterion had even before. the 
fortieth trial correctly anticipated every item 
at least once during the course of the ex- 


periment. 
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cases, the median r between 2; and 
log £ was .95. 

From the data points (f, 5)— 
whether actual or extrapolated— 
there was further computed the total 
amount of time, T, required by each S 
to learn the first 4 pairs he learned, 
and where appropriate, the first 8, 12, 
and 20 pairs learned. (For example, 
in the case of Ss who learned 12-pair 
lists, the amounts of time to learn 
were computed for the first 4 and the 
first 8 pairs they happened to learn 
as well as the amount of time to learn 
all 12 pairs in the list.) This computa- 
tion, patterned after a procedure used 
by Bugelski (1962), totals the amounts 
of time during which each of a given 
set of pairs has been presented up to 
and including the trial on which the 
pair is correctly anticipated for the 
first time. For example, an individual 
who required 5, 9, 10, and 16 trials, 
respectively, to make correct re- 
sponses for the first four items he 
acquired in an 8-pair list presented at 
the rate of one pair every 3 sec. 
would be reckoned as having required 
3(5 +9 + 10 + 16) = 120 sec. for 
this learning. Logarithms (to the 
base e) of these time scores were taken 
in order to secure arrays with more 
symmetrical distributions than would 
have been the case with the original 
time scores in seconds. The resulting 
logarithmic scores were used for 2 
series of analyses of variance, to be 
described below. 

Of primary interest are the means 
displayed in Table 3, for these report 
how much time is required for learning 
(to a criterion of one correct anticipa- 
tion each) a given number of pairs in à 
given experimental setting. Each 
value entered in the table is a mean 
for four cases, the data for the two 
list forms, A and B, and for the two 
learning ability levels being pooled. 
It is evident that the amount of time 
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TABLE 3 


MEAN Loc, Times (SEC) TAREN to ACQUIRE TO A CRITERION OF ONE CORRECT 


RAL LENGTHS AND I 


NTED AT THREE RATES 


Meaningfulness 
ERA : : 
High Medium Low 
List Length 
Rate of Presentation (Sec.) 
3 4 8 3 4 8 3 4 8 
First 4 Pairs 
4 sar | xci | 449 | 478 | 490 | 496 | 491 | 465 | 5.22 
8 yar | sss | 416 | £67 | 3.75 | 473 | 468 | 481 | 515 
12 231 | 3537 | 416 | 443 | 428 | 474 | 454 | 454 | 528 
20 $36 | 372 | 421 | 419 | 412 | 486 | 464 | 412 | 4.85 
First 8 Pairs 
it 8 Pairs 450 | 460 | sos | 5.99 | $03 | 5.75 | 612 | 5.93 | 613 
12 139 | aaa | 50r | $43 | 528 | S81 | $37 | 382 | 030 
20 449 | 4.5 $09 | 512 | 5.06 | 5.80 | 5.66 | 5.38 | 5.95 
First 12 Pairs 
L ise FT RR 611 | &11 | 647 | 655 | 644 | 7415 
20 iss | 5.15 375 | 562 | 641 | 617 | 5.70 | 6.68 
20 Pai 
20 cop am | ist | om | au | run | 747 | 67a | 7a 


Note.— Each entry is the mean of four values 


of ability), 


required to learn a certain number of 
Pairs is relatively independent of the 
ength of the list in which these pairs 
are being learned. The mean log. 
times required to learn, say, 4 pairs 
Of high-meaningfulness syllables ata 
“Sec, presentation rate lie within the 
range 3.55-3.72 whether the syllables 
are in a list by themselves or imbedded 
in a list of 20 syllables; the mean loge 
imes to learn 8 pairs under similar 
Conditions fall between 4.44 and 4.60; 
Or 12 pairs, between 5.05 and 545; 
Da for 20 pairs, the mean log, time 
th 6.11. This would suggest that 
w fre is a uniformity 1n the rate at 

hich new information can be ac- 
quired that largely transcends the 
Ontext in which the learning takes 
Place, 

Data presented in Table 3 suggest, 


(from two alternate forms of each list taken by Ss at two levels 


however, that the mean loge time to 
learn the first 7 pairs tends to de- 
crease somewhat with the length of 
the list in which these pairs are im- 
bedded. In four-way ANOVAs per- 
formed for each of the first three 
blocks of Table 3 (ability level being 
also used as one of the main effect 
variables), list length was a significant 
effect at either the 1% (for 4 pairs 
and for 12 pairs) or the 5% for 8 
pairs) levels. This is probably due 
to the sort of selection artifact noted 
by Williams (1961): on the as- 
sumption that an S learns items in 
the order of their difficulty for him, 
the first n items that are learned by a 
given S in a list of a given length L 
(L > n) are likely to be easier for him 
than the fixed items found in a list 
of length m, even though the lists are 
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designed to be homogeneous in diff- 
culty for populations of Ss. 

An analysis of variance was also 
performed for all data in Table 3 per- 
taining to acquisition of complete 
lists (i.e., learning of the 4 pairs in a 
4-pair list, of 8 pairs in an 8-pair list, 
etc.). All main effects were significant 
beyond the 1% level; F ratios with 
associated degrees of freedom were as 
follows: Presentation Rate, F (2, 72) 


= 19.1; Meaningfulness, / (2, 72) 
= 105.2; List Length, F (8, 72) 
= 164.9; Ability, F (1,72) = 10.2. 


No interactions approach significance 
except Meaningfulness X Ability 
(p = .10) and Rate x Meaningful- 
ness (p = .16). The main effects 
account for 83.8% of the total vari- 
ance in the data, and to this degree it 
would be satisfactory to predict the 
observations by using the main effect 
means shown in Table 4 in the type of 
linear equation underlying the anal- 
ysis of variance. 

The significant effect obtained for 
rate of presentation requires comment 
in view of Bugelski's (1962) finding 
that within the limits of the conditions 
he used, total time for learning did 
not vary significantly even when rate 
of presentation varied. The present 
experiment was conducted before 
Bugelski's report appeared in the 
literature, and is not exactly com- 
parable to Bugelski's: Bugelski used a 
constant stimulus-term time of 2 sec., 
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exposed the response syllable for 
cither 2, 4, 6, 8, or 15 additional sec., 
and allowed a 2-sec. interval between 
pairs. Thus, Bugelski’s 2-sec. and 
4-sec. response conditions amount, 
respectively, to total times of 6 and 8 
sec. per pair. Further, Bugelski used 
college sophomore Ss (in contrast to 
our tenth graders), set a criterion of 
two successive anticipations of a 
given pair, and gave no warm-up 
session. Nevertheless, our data for 
low-meaningfulness 8-pair lists at 
4-sec. and 8-sec. rates of presentation 
are roughly comparable to Bugelski's 
data for his 8-pair nonsense-syllable 
lists at 6-sec. and 8-sec. exposure rates. 
Converting our. mean log, times of 
5.93 and 6.13 for these conditions 
(from Table 3) to mean seconds per 
syllable, we obtain values of 48.2 and 
52.5 sec., respectively, to be compared 
with Bugelski's mean values of 61.2 
and 70.1 sec., respectively—reported 
by Bugelski as not significantly differ- 
ent. In view of the lack of compara- 
bility of procedures and analyses, the 
two sets of results are not seriously 
discrepant. But it will be noted that 
our range of presentation times (8,4, 
and 8 sec.) has only a small overlap 
with the range of times used by 
Bugelski (6, 8, 12, and 19 sec. by a 
comparable reckoning).  Bugelski's 
finding that total learning time is 
roughly constant applies, it would 
appear, only to relatively slow pres- 


TABLE 4 


MAIN EFFECT MEANS ron Inracr Lists (Loc, 


SEc.) 


Mate (eee). | w» E773 M 
3 os H 4.97 
4 5.48 M 5:0 
8 6.03 L 6.26 


List Length M 


Ability M 
4 4.52 Fast 5.59 
8 5.46 Slow 5.83 
12 6.07 
20 6.78 


Grand M: 5.71 
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entation rates. lt is also. probable 
that it applies more particularly to 
difficult, low-meaningfulness materials 
in which there are likely to be a large 
number of trials with the faster 
presentation rates. In our data, it is 
clear that high-meaningfulness ma- 
terials are more efficiently. presented 
With fast rates; a rate of 8 sec. tends 
to hold the learner back. For low- 
meaningfulness materials, on the other 
hand, there is some evidence of a 
curvilinear trend such that a 4-sec. 
rate is superior to either a 3-sec. rate 
Or an 8-sec. rate. It was noted 
above that the Rate X Meaningful- 
ness interaction approached signifi- 
cance (p = .16). 

The demonstration that ability 
level was a significant main effect 
(b < .01) means merely that there is 
a significant correlation between an 
external measure of paired-associate 
learning ability (Part V of the MLAT) 
and ability manifested in the experi- 
Mental task itself. This does not 
*Xclude the possibility that the ex- 
Pirimental error ("within cells") in- 

udes ability variance not predicted 
"ar the external measure. 
ton of data to Thurstone's rational 
Of isi. Jor learning time as a function 
tio, st length —Thurstone (1930, Equa- 

n 40) proposed that the relation 
tween the number of items in a list 
and the total learning time is given by 


[2] 


——— 


; c 
T= -nyn—-4@ 
Q" 


where T = total learning time meas- 
red to any arbitrarily stipulated 
estee of perfection, # = number of 
me in the task, c = a constant de- 
ending on the complexity of the task 
the degree of perfection required, 
ada, S's learning ability, and @ = “an 
m Ptation constant of the subject esa 
ually | | his attention span. As- 
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suming a = 0, taking logarithms, and 
introducing a constant s for the ex- 
ponent of n (s being equal to 1.5 for 
lhurstone's equation with a = 0), we 
have the linear equation 


log T = s log n + loge — log k [3] 


The logarithmically plotted graph 
(Fig. 1) of the whole-list data from 
Table 3 suggests that these data could 
reasonably be fitted by Equation 3 
using appropriate parameters for c 
and A. To test this observation, the 
total variance of the dependent vari- 
able expressed logarithmically was 
broken down into portions attribu- 
table to (a) meaningfulness, rate of 
presentation, and their interactions; 
(b) the regression on list length 
expressed logarithmically, separated 
into components attributable to a 
common slope for all conditions and to 
deviations from the common slope 
attributable to particular levels of 
the aforesaid main effects and their 
interactions, and (c) departures of 
array means from a common linear 
slope and from slopes for particular 
levels of the main effects and their 
interactions? Ability level was not 
studied as an independent variable in 
this analysis; thus, the experimental 
error term includes variance that 
could be in part ascribed to ability 
level. As in the previous analysis, 
Meaningfulness and Rate of Presenta- 
tion were highly significant sources of 
variance (the mean squares remaining 
the same and the size of the error 
term being only slightly different). 
For the Common Slope of log learning 
time as a linear function of log list 
length, F (1, 108) — 460.9, obviously 
highly significant. No other signifi- 
cant F ratios were found, from which 


5 The computational procedure is a special 
case of a highly generalized regression analysis 
procedure developed by Albert E. Beaton, Jr. 
(Beaton, 1964). 
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e" 
—Á i " 
a" | Rate of Presentation —<—— 
mi Pd DA 


Loge Learning Time (Sec.) 


Fic. 1. 
medium, low) and rate of presentation, 
ability levels, two list forms.) 


it may be concluded that (a) the nine 
curves shown in Fig. 1 do not depart 
significantly from linearity and (b) 
the slopes of the lines fitting these 
curves in a least-square sense do not 
depart significantly from a common 
value. Over the whole set of data, 


TABLE 5 
CONSTANTS FOR EQUATION 4 


2.0 
Log, List Length (L) 


Log learning times as a function of log list length, 


R 
Meaningfulness Log cm (Se) Log cr 
High gi : -49 
Medium .19 P m 
Low 32» 32 
eS li cum 


2.5 3.0 


by meaningfulness (high, 


(Each point plotted is a mean for four cases—tW° 


this regression slope is 1.4130, with 
95% confidence limits 1.2876 to 1.5384 
that obviously include Thurstone's 
rationally derived value of 1.5. The 
two main effects and the common 
linear slope account for 84.1% of the 
variance. It is thus possible to 
specify Equation 3 as 
(log T)est = 1.4130 log n 

+ log c, + log c, + 2.5484 [4] 
where the values of log c; and of log 6r 
are chosen depending upon the levels 
of meaningfulness and of rate of 


presentation, and 2.5484 is the inter- 
cept for the regression line averaged 
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over all data. Log c, and log c. are 
actually the deviations of meaningful- 
ness and rate array means (logs) from 
the grand mean (log), and are as 
shown in Table 5. 

Role of ability —Since ability was a 
significant main effect in Table 4, it 
is possible to supplement Equation 4 
on the model of Equation 3 by adding 
a term for ability level, (— log ka). 
In this case, log & for fast learners 
would be equal to 12, and for slow 
learners it would, of course, be —-12. 
But an even better prediction is ob- 
tained by using the actual scores on 
Part V of the MLAT and on the 
pretest, The correlations of these two 


—— Fast Learners 
ia mier Slow Learners 


Learning Time (Sec.) 


Loge 


pate ing times as à 
2. Log learning times * 
em ability (fast, slow). (E 


Tes Y 
Sentation, two list forms. 


2.0 
Loge List Length(L) 


function of log 
Each point plottec 


un 
ne 


measures with log learning times were, 
respectively, —.13 and —.10, and 
their intercorrelation was .24. It was 
found that an optimum composite of 
the two ability measures was a signifi- 
cant component of the variance of log 
learning times, F (2, 136) = 26.9, 
p «0005. The introduction of con- 
tinuous variables of ability into the 
prediction. equation resulted in an 
overall slope of 1.4661, closer to 
"T hurstone's rationally predicted value 
of 1.5. 

It will be recalled, however, that in 
an analysis of variance cited earlier, 
Ability and Meaningfulness had a 
nearly significant interaction. 1f we 


2.5 


ness (high, medium, 


is h, by meaningful: 
EEE es s—three rates of 


lis a mean for six case 


552 


examine Fig. 2, which presents the 
data of Fig. 1 recombined for rates of 
presentation but broken down by 
Ability groups, we observe that Abil- 
ity has a particularly noticeable effect 
for lists of Medium meaningfulness. 
A possible interpretation is that our 
paired-associate measure of learning 
ability (Part V of the MLAT) may be 
to some extent correlated with vocab- 
ulary knowledge, and if so, the low- 
frequency words in the medium-mean- 
ingfulness lists (see Table 1) would be 
more likely known to the fast learners 
than to the slow learners; for the lat- 
ter, these low-frequency words would 
be virtually equivalent to nonsense 
syllables and thus rather similar to 
low-meaningfulness materials. 


Discussion 


The impetus for this study was the 
belief that information concerning ac- 
quisition rates for paired-associate ma- 
terials under various conditions would be 
of use in planning efficiently programed 
instructional materials. 

One of the obvious conclusions to be 
drawn from the experiment is that de- 
pending upon the nature of the materials, 
the conditions of training, and the ability 
of S there are certain limits to the rate 
at which new responses can be acquired. 
The experiment confirms "Thurstone's 
(1930) prediction that as the number of 
items to be learned increases, there is an 
increase in the amount of time that must 
be spent per item. For if we divide 
Equation 2 through by n in order to get 
t, the learning time per item, we find that 
t increases approximately as the square 
root of the number of items. The only 
hope of getting people to learn more 
efficiently is to vary the conditions of 
training (including, presumably, varia- 
tions in incentive, which were not ex- 
plicitly studied here). The immediate 
suggestion coming from the experiment 
is that the optimal rate of presentation 
is b yaired-associate experiment of this 
ixi * approximately 4 sec. as con- 
trasted to 3 or 8 sec. 
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A major insight that comes from this 
study and that was somewhat unantici- 
pated was that items appear to be ac- 
quired one by one and at a rate that is 
relatively independent of the total num- 
ber of items presented in a given trial 
or session. For example, it has been 
Shown in this study that it takes about 
the same amount of exposure and re- 
sponse time for an S to acquire 4 items 
regardless of whether they stand alone 
or are imbedded in a list of 20 items. It 
is possible that Thurstone's square-root 
law is limited to situations in which the 
whole of a list is presented in cach trial 
and that it would not apply in the same 
way to some other organization of the 
learning task, e.g., a progressive part 
method. This problem has not been 
adequately investigated for paired-asso- 
ciate learning. 

It would be desirable to obtain data 
for lists much longer than 20 pairs in 
order to verify the postulated equation. 
Thurstone (1930) has already discussed 
the application of his equation to several 
sets of data involving long lists of 
nonsense syllables or digits to be learned 
serially. There is some credibility im 
extrapolating the present data, however, 
when we try to predict Thorndike's 
(1908) finding that on the average, à 
group of students learned the English 
meanings of 1,030 German words in 30 
hr. Assuming that learning of English 
meanings of German words is a task 
equivalent in difficulty to our medium- 
meaningfulness level, and that Thorn- 
dike's students tended to select the most 
efficient presentation rate for this level of 
difficulty (4 sec. per pair), we may apply 
Equation 4 and estimate that on this 
basis 1,030 pairs would require about 
222,100 sec. or about 61.6 hr., a value 
Which is in the same decimal order of 
magnitude as Thorndike's finding. (Ac- 
tually, Thorndike’s students spent up to 
an additional 8 hr. testing themselves, 


End undoubtedly some learning occurred 
here.) 
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ROLE OF APPARENT SLANT IN SHAPE JUDGMENTS! 


WILMA A. WINNICK asp ILANA ROGOFF 


Queens College of the City University of New York 


2 parallel experiments are reported bearing on the shape-slant invariance 
hypothesis. Apparent-objective slant scales were 1st determined for 4 


rectangles, 2 trapezoids, a random shape, and an ellipse. 


Apparent 


slant was found to be less than objective slant at 10°, 20°, and 30°, and 
to be greater at 60° and 80°; accuracy was achieved at close to 40°. 
Shape judgments were then measured for the same forms set at the 
same angles. The obtained increases in the quantity a — p as a func- 


tion of slant agreed with predictions from the slant scales. 
dicted was the finding of overconstancy at 10° and 20°, 


The experiments to be reported 
comprise an attempt to draw a re- 
search parallel between size constancy: 
and shape constancy, a parallel which 
has been pursued on the theoretical 
plane by Koffka's (1935) treatment of 
the constancies. Apparently, size 
constancy has challenged Es much 
more than has shape constancy, and a 
wealth of experimentation on its 
parameters has been forthcoming 
(Woodworth & Schlosberg, 1954). 
Shape, on the other hand, has been 
shown in a recent review (Epstein & 
Park, 1963) to be lacking in“. . . well 
established functional relationships." 

With size constancy apparently de- 
pendent upon perception of distance, 
shape constancy would seem to be 
dependent upon perception of the 
angle of orientation of the viewed 
object. Beck and Gibson (1955) have 
formalized this relationship as ‘the 
shape-slant invariance hypothesis" : 
“A retinal projection of a given form 
determines a unique relation of ap- 
parent shape to apparent slant Lp. 
126]. Although a few Es have 


1 The slant experiments are based upon a 
thesis submitted by the second author to the 
Graduate Division of Queens College of the 
City University of New York in partial fulfill- 
ment of the requirements for the degree of 
Master of Arts. This study was partially sup- 

orted by a grant from the Graduate Division 
m the City University of New York. 


Not pre- 


studied judgments of slant in shape- 
constancy experiments (e.g., Stav- 
rianos, 1945), and others have studied 
cues to slant judgments (c.g., Smith, 
1956), no experiment to the writers 
knowledge has attempted to measure 
à scale of slant judgments paralleling 
Gilinsky's (1951) determination of a 
scale of distance judgments and fur- 
ther to relate such a scale to shape 
constancy as a function of slant. The 
value of this scale is indicated by the 
statement that “the adequacy of the 
(invariance) hypothesis depends on 
the possibility that the various factors 
whose influence on shape constancy 
has been demonstrated may be shown 
to affect perceived slant [Epstein & 
Park, 1963, p. 2833." 

The first experiment reported, ac- 
cordingly, undertook the determina- 
tion of the nature of the relationship 
between apparent and objective slant 
as a function of conditions of observa- 
tion and characteristics of the form 
whose slant was to be judged. These 
determinations were followed, in the 
second experiment reported, by meas- 
urements of shape judgments for the 
same stimuli at the same angles of 
slant. 


EXPERIMENT I 
Method 


Experimental design.—F our parallel studies 
Were carried out; the first two measured ap- 
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Darent-objective slant scales. for four rec- 
tangles at 207, 407, 60°, and 807, the firstunder 
restricted conditions, the second with full 
view; the third study measured the scale for 
two trapezoids, an ellipse, and a random shape 
at the same angles; the fourth measured the 
scale for one rectangle set at 10°, 20°, 30°, 
and 40°, In each of these independent 
studies, all Ss judged each form used at each 
of the angles studied. 
. Apparatus.—The 
judge, using binocular vision, the 
inclination of a single black figure set 2ft: 
away in the center of a gray experimental 
chamber of 16.25 in. in width, 21 in. in height, 
and 2 ft, in depth. In the first study, the 
figures were viewed through a l-in. opening 
which permitted a view of the middle part of 
the rectangles only. This was judged to be 
similar to Lichtes (1952) experimental 
situation, in which judgments of shape were 
ased on the width of the rectangles only. 
In the other studies removal of the top of the 
chamber provided Ss with a view of the tops 
of the stimulus figures, as well as other parts 
of the chamber. 

Subjects.—The 64 volunteer 95, 16 in each 
experiment, were university undergraduates 
ranging in age from 19 to 27 yr. None had 
Serious visual defects, and 30 wore corrective 
enses at all times (and for this experiment). 

Procedure.—The method used in each of 
the four studies was a modification of the 
Method of average error, in which judgments 
Were made about a single stimulus. On each 
trial, the task of S was to indicate when, as E 
Varied the slant of the figure by rotating it to 
>S right, it appeared to be inclined at the 
angle called for (20°, 40°, 60°, and 80° in foe 
ie three studies and 10°, 20° 30°, a ig 
k the fourth). To insure uniform knowled ge 
a angles, each S was given à linear repre- 

Ntation of the angles used. 
€ session began with an expl 


Ss were required to 
angle of 


anation of 


a task and apparatus and two trials to um 
S understanding of the instructions. 


ca ] à 
ach experiment each of the four forms was 


bonented with four angles for eight pes 
tions ascending and descending deters » 
figu 5. Blocks of trials were amne " 
Abie d varied as follows ABC P 
Were at DCBA. The within-bloc 
sce arranged randomly for angles 
nding and descending trials- x: in 
manipulati rms We! 
randomly either a ES. E frontal-paralle 
E (ascending) or at right angles Geak 
e ‘he form was slowly turned m a 
ing C" cal plane until S reported man i s 
Med on his right side at the required ang 


ks trials 
for both 
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from the frontal-parallel plane. The E re- 
corded the exact setting at which S judged the 
inclination to be at the angle called for (from 
a protractor located at the base of the dowel 
supporting the figure). 

Stimuli.—The forms used were, for the 
first two studies, rectangles all 5 in. in height 
but varying in width, as follows: (A) 3.25, 
(B) 3.75, (C) 425, and (D) 4.75 in. These 
rectangles correspond to those used by Lichte 
(1952) in his study of shape constancy. 

For the third study, the forms were two 
trapezoids, one ellipse, and one random shape. 
The two trapezoids had identical dimensions 
(5 in. in width and 10 and 7.5 in. in height), 
one with the obtuse angle on S's left (C) and 
the other on .S's right (D). ‘The trapezoids 
were such that their visual (geometric) pro- 
jection was equal to that of a rectangle viewed 
at a tilt of 26° (tilt to the left for C and to the 
right for D). In the ellipse, the height of the 
vertical diameter was 11 in. and of the 
horizontal diameter, 5 in. The random shape 
used was Shape B from Borresen and Lichte 
(1962). 

Since no effect of rectangle width had been 
found in the first two studies, the fourth 
used only the rectangle of 4.75 in. width. 


Results and Discussion 


Table 1 summarizes the mean 
settings obtained (objective slant) for 
the angles called for (apparent slant), 
and Table 2 summarizes the results 
of the analysis of variance for the four 
studies. 

TABLE 1 
MEAN SETTINGS IN DEGREES TO MATCH 


ANDARD ANGLES IN THE Four 
Sruptes or Exr. T 


Rectangles 20° | 40° | 60° | 80° 

Study I, Restricted View 259 396 | 55:7 | 722 
$ 38 m d 23.6 | 38.0 | 51.0 | 70.0 
C. 425 in. 241 | 37.1 | 53.2 | 70.9 
D. 4.75 in. 232 | 37.8 | 53.1 | 70.5 
Sud AUR NN 26.0 | 39.4 | 53.1 | 70.5 
p. 3.75 in. 95:5 | 39.9 | 52.7 | 68.4 
C. 425 in. 262 | 29.3 | 52.3 | 68.4 
D. 4.75 in. 248 | 39.9 | 53.3 | 68.8 
study DT eun SES 23.7 | 37.0 | 53.9 | 70.8 
3 XEM IE IEJE: 
& Teal 24.0 | 38.7 | 57-4 | 74.3 
Study IV 109 | 20° | 30° | 40° 
4.15 in. 17.5 | 241 | 324 | 38.5 
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TABLE 2 


ANALYSIS OF VARIANCE FOR MEAN JUDGMENtS IN 10 Four Srepins ov Expr, | 


Study I Study II | Study III Study IV 
Source df z > 
MS F MS | | MS F 
Ss 15| 134.5 73.5 | 105.6 
Angles (A)| 3 (25,776.20 | 7.99**| 21,9790 792.6**| 29,829.0 |926.4** 
Forms (F)| 3 94.2 6.6 202.6, 19.3** 
SsxF 45| 238.8 6.8 | 10.6 | 
SsXA 45| 322.5 27.7 | 323 45 1.64 
AXF 9 13 6.8 | 30.2| 16.1** 
SsXAXF | 135 126.7 4.5 1.9 
Total 255 | | 63 
**5 «.01. 


The data obtained in the first two 
studies (restricted and full view of the 
four rectangles) are highly similar, 
indicating the use of similar cues in 
the two situations. Both analyses of 
variance obtained a significant F only 
for angles, with F values for rectangles 
and interaction of rectangles and 
angles falling short of significance. 


80 


o 
o 


MEAN SETTING 
A 
o 


(OBJECTIVE ANGLE) 


N 
o 


20 
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(The statistical design does not permit 
investigations of the other interactions 
nor of the triple interaction.) 

The graph relating apparent to 
objective slant for the 4.75-in. rec- 
tangle under full viewing conditions is 
shown in Fig. 1 and will be considered 
to represent the apparent-objective 
slant scale for rectangles, since data 
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Mean settings (obt.) in degrees of 4.75-in. rectangle for each of the four standard 
à ies under full view conditions, compared to veridical judgments (ver.). 
angles 
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ie the first two studies were close 
ied and showed no effect of 
b ES : Statistical comparisons of 
etn with veridical points yielded 
5.6 PU (15) values: 5.5 for 20°, 
arn! 60°, and 9.8 for 80° (p < .01 
Or all). These results indicate ap- 
Parent slant to be significantly less 
than objective (a positive constant 
error) at 20? and to be greater (a 
pe e constant error) at 60° and 

Where nonrectangular forms were 
used, in the third study, the results of 
the analysis of variance Table 2) 
showed a significant F for forms 
(5 < .01) and for Forms X Angles in- 
teraction (p < .01), as well as for 
angles (5 < .01). Although no differ- 
ences were found in the scales for the 
ellipse and the random shape, a clear- 
Cut difference in settings for the two 
trapezoids is evident from the graph 


s in degrees for the two trapezoids (C, D) 
compared to veridical judgments (ver.). 


in Fig. 2. Comparisons of obtained 
settings with veridical points found 
t (15) values for Trapezoid C of 27 
at 20° (p < .05), 3.8 at 60° (p < -01), 
and 5.7 at 80° (p < .01); for Trape- 
zoid D, obtained ¢ values were 4.2 
at 20? (p < .01), 2.0at 60? (p « .05), 
and 4.8 at 80? (p < .01). Further- 
more, a comparison of mean pooled 
judgments for the two trapezoids 
yielded at (30) = 8.6 (p < .01). This 
contrast in settings of the two trape- 
zoids lends some support to a Pragnanz 
view of shape constancy (Epstein & 
Park, 1963, p. 275). 

The fourth study demonstrated, in 
its extension of the scale toward zero, 
that the positive error found at 20° 
occurred also at other small angles. 
The resultant mean judgments for the 
one rectangle (4.75 in.) used are 
shown in Table 1, the analysis of 
variance in Table 2, and the apparent- 
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objective slant scale for these angles 
in Fig. 3. The analysis of variance 
found the / value for angles to be 
significant at better than the .01 level. 
Comparisons obtained with veridical 
judgments yielded the following ¢ (15) 
values: 15.0 for 10° (p < .01), 4.1 
for 20? (p < .01), and 2.7 for 30° 
(p < .05). 

Since no previous study has in- 
vestigated the relationship between 
apparent and objective slant over as 
full a range of angles, comparisons of 
the data must be made with studies 
using smaller segments of the arc. 
Studies by Clark and his associates 
(Clark, 1953; Clark, Smith, & Rabe, 
1955, 1956a, 1956b) have been di- 
rected toward identification of monoc- 
ular cues to slant and have found 
apparent slant to be less than objec- 
tive slant for angles of 53? (Clark, 
1953), of 0°, 20°, and 40° (Clark et 
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al, 1955), and of 40° (Clark et al, 
1956a, 1956b). In Smith's (1956) 
study of monocular judgments of six 
slants (10°, 20°, 30°, 40°, 50°, and 60°) 
perceived slants were found, in partial 
agreement with the present study, to 
be less than objective slant for all 
angles. 

Applying these data directly to the 
invariance hypothesis, one may pre- 
dict that, if constancy of shape is 
dependent upon accuracy of slant 
judgments, shape estimates for rec- 
tangles and ellipses should be most 
accurate from 35° to 40° (from 25° 
to 30° for Trapezoid C, 38° to 40° for 
Trapezoid D) and should fall off for 
smaller and for larger angles. Fur- 
ther, if the slant of an object is under- 
estimated (as for the smaller angles 
and for Trapezoid C), its perceived 
shape should be close to projected 
(retinal) shape. On the other hand, 


. ver 
e obt. 


10 


20 


30 40 
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Mean settings in degrees for the 4.75-in. rectangle at the four standard angles 
: in the fourth study, compared to veridical judgments. 
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TABLE 3 
MEAN WIDTH SETTINGS IN Ix. TO MATCH THE SLANTED Forms iN Expr. HI 
| 10° | 20° | c3 | 40° 60° a ‘ 
in. rectangle 486 | 493 | 4.79 37 
in. Trapezoid C | 505 190 | 491 131 133 EH 
in. Trapezoid D | — $26 5.13 | 5.20 4.96 4.62 Pre 
wellipse 303 | 510 | 500 4.95 478 hp 


where slant is overestimated (as for 
the larger angles and for Trapezoid 
D), perceived shape should be far from 
projected shape. 

, The relevant constancy measure 
into which these predictions might be 
transposed is given by the quantity 
a — p, where a is apparent shape and 
P is projected (retinal) shape. Using 
this formula as the basis for predic- 
tions, data should reveal an increase 
m this value with increasing angles. 
Furthermore, the quantity à — p 
should be small for Trapezoid C and 
large for Trapezoid D, in comparison 
With its value for rectangular forms. 


EXPERIMENT Il 


A. provide the data needed to 
Mies the predictions of shape as a 
sl lon of slant, based on the shape- 

ant invariance view, two studies 
Were undertaken measuring shape 
Judgments for the forms used in the 


previous experiment set at the same 
angles of inclination in the same 
apparatus. 


Method 


Apparatus and procedure.—Standard stim- 
uli (the 4.75-in. rectangle and the 5.00-in. 
trapezoids in the first study; the 5.00-in. 
ellipse in the second) were presented at 10°, 
20°, 30°, 40°, 60°, and 80° in the chamber al- 
ready described. Both studies presented com- 
parison stimuli in the frontal-parallel plane, 
but the methods of matching had to be some- 
what different, as dictated by the nature of the 
stimuli. 

For measuring constancy for the rectangle 
and the two trapezoids, a comparison stimulus 
was presented equal in height to the standard 
and of width that could be varied to match 
the width of the standard in accordance with 


the method of average error. More specific- 


ally, the apparatus consisted of a central 
le set immediately be- 


black Masonite rectang! 

hind and framed by a free-standing gray 
Masonite rectangle 16.25 X 21 in. This 
gray rectangle, equivalent in outside dimen- 
sions to the experimental chamber, was set 
at the same distance from S as was the 


TABLE 4 
ANALYSIS OF VARIANCE OF MEAN SETTINGS OBTAINED IN EXP. II 
Rectangle and Trapezoids Ellipse 
Source 
df MS F df MS F 
5s (Ss 43 14 25 
Angles (A) 5 15.96 3112** 5 1.30 7.22** 
orms (F) 2 1.82 4.23* E 
SXA 70 .60 70 48 
SXF 28 136 
ase 10 43 
DSXA > 
Po F] ds 89 
* p <05. 
**$ «c OL, 
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standard rectangle within the chamber. The 
black variable rectangle, centered within a 
cutout from the large gray rectangle, could be 
varied in width from 2.0 in. to 7.5 in. by 
moving on a pulley system a vertical piece 
equivalent in height and general appearance 
to the gray framing rectangle and 6.0 in. in 
width. For each figure at each angle, E 
varied the width of the comparison stimulus 
until S judged its width equal to that of the 
standard; two determinations, one ascending 
and one descending, were made for cach S for 
cach form at each angle. A randomized 
arrangement of the three forms and six 
angles was employed across Ss, 

In the second shape study, comparisons 
were made of the standard ellipse (11 X 5 in.) 
with variable ellipses all 11 in. in major axis 
and with minor axes varying in units of -25 in. 
from 2.0 to 6.0 in. The variable ellipses were 
presented one by one in randomized arrange- 
ment, in accordance with the method of 
constant stimuli, Two determinations were 
made for each S at each of the six angles of 
slant. Order of presentation of angles was 
varied across Ss, 

Subjects.—Thirty university undergradu- 
ates served as Ss, 15 in each study. All Ss 
had had previous experience in making the 
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judgments called for; in addition, half had 
served as Ss in the slant experiments. 


Results and Discussion 


Table 3 summarizes the data ob- 
tained in the two shape-judgment 
studies, and Table 4, the analyses of 
variance. [n both, the F values for 
angles were significant at the .01 level, 
and for the first study, the F value 
for forms reached significance at 
better than the .05 level. 

The relationship between apparent 
Shape and angle of inclination is 
shown by the graph in lig. 4 for the 
rectangle and in Fig. 5 for the two 
trapezoids and the ellipse. These 
graphs also plot object shape and 
projected shape, permitting estimates 
of the value a — b. All of these 
functions show similar relationships— 
everconstancy at 10? and 20°, near 
object matches at close to 30°, and 
matches falling away from projected 
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Apparent shape as a function of angle of inclination for the 4.75-in. 
Apps compared to object (obj.) and Projected (proj.) shape. 
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and the ellipse, compare 


path at the larger angles. For all 
ot the quantity a — $ may be 
im» by inspection to increase as a 
ction of angle, and Fig. 6 shows a 
Fa of this relationship. 
The data obtained for the ellig 
ta. be compared with results ob- 
me ed by Nellis (1958) in an experi- 
e line measuring shape judgments for 
Pses set at angles of 15°, 30°, 45°, 
diffe and 75°, Using a somewhat 
erent method, Nellis’ obtained 
“Nctions relating judgments to angle 
py PClination are similar to those im 
de 5; she found Ss making their 
to ant matches to object size at close 
the of slant, with overestimation at 
tion Smaller angles and underestima- 
Stug at larger angles, as 1n the present 
bug She (Nellis, 1958, p- 49) com- 
incr the value a/p and found an 
ĉase in this quantity for a 


pse 
m 


d to object (obj.) 


and projected (proj.) shape. 


ellipse from 1.06 at 15° to 2.82 at 18. 
Converting the present data into @/p 
values, there is a comparable increase 
for the 5.0-in. ellipse used from 1.02 at 


10° to 4.23 at 80°. 


GENERAL DISCUSSION 


Confronting the shape-constancy data 
obtained in the second experiment with 
the predictions based on the scales of 
apparent-objective slant from the first 
experiment, the data lend some support 
to the shape-slant invariance hypothesis. 
Predictions as to the locus of accuracy of 
shape matches were not confirmed with 
precision; closest match to object size 
was made for the rectangle and the 
ellipse at close to 30°; i.e., at a smaller 
setting than predicted. The two trape- 
zoids, equivalent to rectangles slanted in 
opposite directions, showed some con- 
trast in shape judgments, though not of 
as great magnitude as for slant estimates. 
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Fic. 6. Compensation as a function of angle of inclination for the rectangle, 
two trapezoids, and the ellipse. 


The data generally support the predic- 
tion that, along with underestimations of 
slant found at the smaller angles, matches 
should be close to projected shape, and 
that overestimations of slant at larger 
angles should result in settings progres- 
sively farther away from projected shape. 
What could not be predicted from the 
apparent-objective slant scale as such 
was the finding of overconstancy at 10° 
and at 20°. These angles were those at 
which Ss had underestimated slant, and 
it was predicted that shape judgments 
should be close to projected shape; since, 
however, at these small angles, projected 
shape is itself close to object shape, 
judgments must also be close to object 
shape. If we view S as aware of the 
inclination. of the form and therefore 
tending to compensate for the angle of 
slant, then his attempt to do this may 
£ e dis judgment, in retreating from 
caus cted shape, to go beyond object 
[e Nellis (1958) has applied this 


kind of explanation to her findings of 
overconstancy at smaller angles: 


Consider the larger angles of tilt where com- 
pensation is manifested by the fact that the 
match lies between projected image-size an 
object-size. If there is compensation wi 
small angles of tilt, then perceived size whic? 
is consistently greater than projected image: 
size also yields a match greater than actua 
size, since projected image-size is so close t? 
object size [p. 87]. 


This compensatory tendency, then, 
appears to be an effect additional to the 
shape-slant relationship and sometimes 
opposing it. The shape-at-a-slant ES 
fluence directs perception toward retin? 
shape; on the other hand, awareness o 
the slant itself may cause a movement © 
the perception toward the real object: 
For most angles of orientation, the result 
of these two processes is a compromise 
lying somewhere between retinal shape 
as determined by the shape-slant invari- 
ance relationship, and the object shape 
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as dictated by the compensatory tend- 
encies within S. In the case of the 
smaller angles, however, since the shape- 
slant effect i slight, the compensation 
seems dominant. 

This two-aspect view is, of course, in 
different terms what Koffka (1935) has 
asserted to be the case: 


According to our assumption a retinal shape 
sets up forces to produce a similar psycho- 
phy shape. These forces come into con- 
flict with the stress in the field when the plane 
in which the shape appears is nonnormal. 
Owing to this stress the retinal shape is 
changed into another shape more like the 
real one [p. 233]. 


The view seems similar also to one that 
Beck and Gibson (1955) have attributed 
to Helmholtz: 


A follower of Helmholtz . would argue 
that the organism discovers in the course of 
time that the cues for the slant of an object 
vary concomitantly with the retinal projected 
form of the object. Only then is he in a 
Position to infer the true shape of the object 
from the retinal shape by combining this 
information with the cues for slant Cp. 127]. 
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IN THE RATING OF GRAYS! 
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Mental Health Research Institute, University of Michigan 


Techniques for iterative experimentation, whereby the results of the 
1st experiment modify the conditions of the 2nd, and the results of the 
2nd in turn modify the conditions of the 3rd, etc. were examined for 


neutralization of the effects of stimulus bias in the psychophysic. 


of gray papers. 
sorting, magnitude selection, and 


al scaling 


The scaling methods of category estimation, category 
ideal selection were employed. The 


techniques were tested by deliberately introducing stimulus bias with 
sets of stimuli, sharply weighted in opposite directions, and determining 
whether the iterative procedure would yield a single empirical scaling 


from both sets of stimuli. 


In most instances, the techniques of iterative 


experimentation were effective in getting rid of the effects of stimulus 


bias quickly. Exceptions to the gen 
methods with extremely weighted 


stimuli. 


An experiment by Garner (1954) 
serves as an important reminder that 
the validity of an apparently straight- 
forward psychophysical experiment 
may be ruthlessly destroyed by seem- 
ingly innocent decisions made by E in 
the initial stages of investigation. 
Garner showed that the sound level 
required for half loudness could be en- 
tirely determined by the range of 
sound levels arbitrarily selected by E 
in the method of constants. Specif- 
ically, Garner exposed three groups of 
Ss to pairs of tones. The S was in- 
structed to indicate whether the sec- 
ond tone was more than, or less than, 
twice as loud as the first tone. The 
first member of the pair was constant 


1 This research was carried out at the 
Applied Psychology Research Unit, Cam- 
bridge, England, while the writer was a 
National Science Foundation Postdoctoral 
Research Fellow (1960-61). The writer is 
especially indebted to Donald Broadbent, 
Director, APRU, for providing a hospitable 
and encouraging research environment and to 
€ hristopher Poulton for long discussions and 
a 2 angements in testing Cambridge Uni- 
eatin Feats: The writer is indebted to 
Se Renee for discussions of early drafts 


of this paper. 


eral finding are sorting and selection 
or extremely narrow initial sets of 


at 90 db. The second member of the 
pair varied from 55 to 65 db. for one 
group, from 65 to 75 db. for another, 
and from 75 to 85 db. for a third group 
of Ss. "The mean and median levels 
required for half loudness were in- 
distinguishable from the midpoints ol 
the stimulus ranges. The means were: 
60 db. for the first group, 70 db. for 
the second, and 80 db. for the third. 
The final result, then, was indistin- 
guishable from a result completely 
determined by the stimuli arbitrarily 
chosen by E, . 

Stimulus bias will be said to be in 
evidence whenever the result of a 
psychophysical study is strongly de- 
termined by the particular stimuli 
arbitrarily selected by E. Garners 
experiment is an excellent, although 
extreme, demonstration of stimulus 
bias. 

The aim of the present paper is tO 
describe, and test, a set of procedures 
for getting rid of—or, more precisely, 
for neutralizing—stimulus bias in psy- 
chophysical experiments. Stated dif- 
lerently, we are concerned with pro- 
cedures which protect data from the 
seemingly innocent decisions made by 
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E in his arbitrary selection of stimuli 
preparatory to an experiment. And, 
since rating scales are particularly 
vulnerable to the effects of stimulus 
bias (J. C. Stevens, 1958), primary 
emphasis will be placed upon getting 
rid of stimulus bias in rating scales. 
The aim is, thus, in sympathy with 
recent studies in the “social psy- 
chology of the psychological experi- 
ment [ Rosenthal, 1963 ]." 

One set of procedures which may 
neutralize stimulus bias is iterative 
(Stevens, 1955). Iterative procedures 
—in which the results of the first 
study modify the conditions for the 
Second, the results of which, in turn, 
modify the conditions for the third, 
€tc.—approach a final result through 
a series of successive approximations. 
On several Occasions, Stevens and his 
Co-workers (Poulton & Stevens, 1955; 
Jr C Stevens, 1958; Stevens, 1955, 
1957; Stevens & Galanter, 1957; 
Stevens & Poulton, 1956; Stevens, 
Rogers, & Herrnstein, 1955) have 
Pointed out the potential of iterative 
experimental procedures for obtaining 
Unbiased psychophysical functions, al- 
though they rarely carried through 

€ iterative procedures in a series of 
Systematic investigations. One ex- 
Ception was the use of an iterative 
Procedure for the unbiased averaging 
9i data (Stevens, 1955). More re- 
“ently, Smith (1961) has advocated 
a Use of iterative procedures for 

mulus programing in order to in- 
Crease the efficiency of psychophysical 
“Xperimentation. An excellent ex- 
“mple of Smith's thesis is the Bekesy 
(i nique, as applied to human 
i irsh, 1962) and to animal (Blough, 
) threshold testing. 
aig V hile major efforts are presently 
teg nd toward the isolation of con- 
nene factors in psychological experi- 
loss ation (Helson, 1964; Johnson, 


i Parducci, 1963; Rosenthal, 
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1963), the essential mode of operation 
of stimulus bias was described by 
Stevens and Galanter (1957). Typ- 
ically, S expects to employ all of the 
available ratings in a rating experi- 
ment equally often (Arons & Irwin, 
1932). When sets of stimuli are em- 
ployed which substantially unbalance 
the equal frequency of ratings, S must 
wrestle with a conflict between per- 
forming the rating task at hand and 
his expectations of a uniform distribu- 
tion of ratings. One method for 
avoiding this confiict is to tailor the 
stimuli, through iterative procedures, 
to conform with S's expectations of a 
nearly uniform distribution of response 
categories. The following techniques 
attempt to provide a more nearly uni- 
form distribution of response cate- 
gories, and thereby, achieve rating 
scales which are relatively independ- 
ent of the initial stimuli arbitrarily 
selected by E. 


ITERATIVE TECHNIQUES 


Technique of adjusted spacing.—This 
technique leads to additional stimuli in 
the stimulus region where sensitivity is 
high, i.e., where the slope of the category 
function is steep; and to fewer stimuli in 
the stimulus region where sensitivity is 
low, i.e., where the slope of the category 
function is shallow. This technique, 
along with the other techniques, makes 
the weak assumption that stimulus ob- 
jects may be arranged to provide a 
monotonic category scaling. The tech- 
nique also assumes that: intermediate 
stimuli may be interpolated along the 
stimulus scale; and, an appropriate spac- 
ing of stimuli corresponds with equal 
differences along the rating scale. The 
technique has been formally outlined by 
Stevens and Galanter (1957, pp. 381- 
382). 

Figure 1 presents a hypothetical cate- 
gory estimation. The ordinate is the 
mean category rating. The abscissa is 
the stimulus scale between a lower and 
upper limit. This scale may represent a 
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Fic, 1. Illustration of the application of the technique of adjusted spacing 


for a hypothetical category estimation. 


variable with equal interval or equal 
ratio properties, a variable with only 
ordinal properties, or a nominal set of 
stimuli that have been arranged only 
after the initial scaling to provide a 
smooth monotonic function. With the 
upper and lower limits of the range fixed, 
the following steps are carried out to ob- 
tain new stimuli between the limits: 


1. Plot the initial category scaling. 

2. Determine the difference between 
the mean category ratings assigned to the 
upper and lower stimulus limits. 

3. Divide the difference by (» — 1), 
where n is the number of stimuli desired 
for iterative scaling. 

4. Add the quotient of Step 3 succes- 
sively to the mean category rating as- 
signed to the lower limit until the rating 
assigned to the upper limit is reached. 

5. Draw a line parallel to the abscissa 
from each of the values of Step 4 to the 

ve. 
Ears perpendicular from the 


intersection to the abscissa. 


(See text for details.) 


7. Read the stimulus values on the 
abscissa scale. 


T'echnique of average rating.— This tech- 
nique attempts to achieve quickly the 
same end as the technique of adjusted 
spacing, when two or more sets of stimuli 
have been employed with common limits. 
The technique assumes that the average 
of two category ratings is less deviant 
from an unbiased rating than the more 
deviant rating alone. The following 
steps are carried out: 


1. Plot the initial category estimations 
upon a single graph. 

2. Average the ratings at correspond- 
ing stimuli. 

3. Obtain a new set of stimuli by the 
method of adjusted spacing. 


Technique of unequal stimulus fre- 
quencies.—This technique attempts to 
achieve an unbiased function when addi- 
tional stimuli cannot be selected. In- 
stead of adjusting the spacing, the tech- 
nique adjusts the frequency of occurrence 
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of a fixed set of stimuli. The technique 
assumes that an appropriate frequency of 
stimulus presentation corresponds with 
equal differences upon the rating scale. 
The following steps are carried out: 


1. From the initial category scaling, 
note the scale rating for each stimulus in 
terms of a monotonic category function. 

2. For any stimulus s, calculate the 
average difference between the ratings 
igned to s and its adjacent stimuli, or 
to the adjacent stimulus for the upper 
and lower stimulus limits. 

Nd the relative frequency. of 
à nce of each stimulus by the 
average differences in weightings. For 
example, if four succ ve stimuli are 
associated with mean ratings of 1, 2, 5. 
on d ie four stimuli would be pre- 

ed with relative frequencies of 1, 2, 


DS i : 
2.5, and 2, respectively. 


E 3 of equal-res ponse frequencies. 
a ee attempts to achieve the 
i m as the technique of adjusted 
appro a The technique assumes that an 
snide ars spacing of stimuli corre- 
i sequal spacings along the cu- 
techni ESPANSE frequency scale. The 
illust que, and related techniques, are 
T rated in Pollack (1964). 
i of single-stimulus presenta- 
problen ns technique logically avoids the 
only a ho stimulus bias by presenting 
Slot E stimulus to each S or by 
Sin ee only the fir response of each 
is an extended series of judgments. It 
techni empirical question whether the 
unction, yields the same empirical 
techni n as the other techniques. This 
didm has been widely employed in 
: Route. scaling (Garner, 1958; Stevens 
ton, 1956). 

Tie ae of category production 
sonable» gaue attempts to yield a e 
Phe S ; distribution of stimuli quickly. 
given, instructed to produce, or select, 
to the t aibe of stimuli, 5, 1 addition 
Sive that succes- 
appear to be b me 

?sired ek (s +2)=%, the num ber at 
Sar imuli, no further step !$ neces 
the ` Tf (s + 2) #0, interpolation of 
Stimulus scale is required. The 


a 


method wa 
(Stevens & 


suggested by G. Stone 
lanter, 1957, pp. 392-393). 


METHOD 


I hh mete em 
29 grays from whi cl oe gt paceigulcl 
‘ays ch sets of 11 grays were 
selected. ‘The reflectances are only approxi- 
mate because of differences in reflectance of 
different samples of the same gray paper after 
repeated use and handling. The qualified 
specification of the stimulus sets is not re- 
garded as serious since only the rank ordering 
of the stimulus set, rather than the reflect- 
ances, was used. Moreover, by concentrating 
upon the order property of the sets of grays, 
we were able to explore the applicability of 
the techniques to "weak" stimulus metrics on 
ordinal scales. 

‘Table 1 also presents the specilic 11 grays 
employed as the initial sets in the various 
tests. For example, Sets 1, J, K each repre- 
sent a wide range of grays with intermediate 
overlapping grays. Sets L, M, N each 
represent a wide range of grays without inter- 
mediate overlap. Sets O, P each represent a 
medium range of gra Sets Q, R, S each 
represent subpopulations in which some 
stimuli are unavailable (u in Table 1). And, 
Sets T-N each represent. narrow ranges of 


ch set of grays may be represented 
1 double rank ordering, as 
‘The abscissa is the rank- 
1 (white) to 29 (black) ; 
the ordinate represents successive stimuli 
from Stimulus 1 to Stimulus 11. In Fig. 2F, 
for example, the open circles represent a set 
of predominantly light grays with a scattering 
of dark grays; the filled squares represent à 
set of predominantly dark grays with a 
scattering of lighter grays; and, the half-filled 
triangles represent a set of predominantly 
intermediate grays with a scattering of light 
and dark grays. 

Experimental methods.—In the method of 
category rating or category estimation (S. $; 
Stevens, 1958, Table 2), S was instructed to 
assign a number from 1 to 7 to grays pre- 
sented one at a time. Paper 1 (white) was 
shown as an example of the lower end of 
Category 1; and Paper 29 (black) was shown 
as an example of the upper end of Category 
7. The S was told that equal brightness 
differences were to be reflected in equal 

The S was also told 


numerical differences. 
that some of the ratings might be used more 
ome might not be 


often than others, that si 


graphically b 
shown in Fig. 2 
order darkness from 
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TABLE 1 
SUBSETS OF 11 GRAYS FROM AVAILABLE POPULATION OF 29 Grays 
PR. | Reflect. I J RL M N POR S T U v w x 
1 72 * * * * + * * E » a 
2 | 66 * * io 
3 | 64 * * ic à 
4 | 58 u 2 
5. | 58 "b 1E X g 
6 | 47 E dE Ge * u »* * 
7 | 43 j * * * k 4 
9 41 * 
9 | 36 * No. M uw à 
10 32 * 
11 | 31 * T d 
12 | 30 * os ox . * i o * + 
13 | 30 * * * 
14 | 24 
15 | 21 * * s 9 P u d 
i7 | it : : ou y : 
* * 
18 | 13 * * * * u a * « 
1 13 * * * #* 
12 * * 
21 | 10 * eA Ww" un i 
22 9.2 
23 | 74 . * 4a Rod 
24 2 * w # * u os 
25 5.7 * 
26 | 54 . s po : 
27 5.2 * * * 
28 4.2 * * 
29 | 35 * * & x € y * 8 . 


Note.—u = unavailable in the subpopulation. 


used, and that the initial white and black 
papers would be included within the stimulus 
series. 

In the method of category Sorting, a set of 
11 grays was successively arranged from 
white to black, or from black to white, The $ 
was instructed to place the grays into seven, 
or fewer, piles so that successive piles repre- 
sent approximately equal differences | in 
brightness. . 5 

In the method of magnitude selection, a set 
of 11 grays was haphazardly arranged upon 
a white background. The S was instructed 
to select that paper which was most nearly 
half as bright (or n - m the = 

E nd. In other tests, the grays were 
rd upon a black background and S 


was instructed to select that paper which was 
most nearly twice as bright (or half as dark) 
as the black background. Since there were 
no consistent differences between judgments 
under brightness or darkness instructions, 
results were pooled over instructions. 

In the method of ideal selection (Coombs, 
1958), a set of 11 papers was haphazardly 
arranged upon a white background. The S 
Was instructed to select that paper which most 
nearly corresponds to “ideal” gray. 

xperimental details —Table 2 summarizes 
the experimental details for the separate 
experiments, The first column lists the 
graphical location of the results, with special 
reference to the double-rank representation of 
the stimuli employed. The second column 
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TION 4 
(4) 


MEAN CATEGORY 


RANK-ORDER DARKNESS 


2. Application of the technique of equal-response frequencies to magnitude selection 


(r 'g. 2A, B, and C); and application of the technique of adjusted spacing to category estima- 


tion (Fig. 3 


Tepresent, 


lists the in 
Present 
Column 


initial sets of stimuli upon the first 
ation, as coded in Table 1. The third 
lists the number of initial sets of 


D and E) for a wide range of gray papers (Sets I, J, K). 
ation of the three initial sets of stimuli is presented in Fig. 2F.) 


TABLE 2 


SUMMARIZATION OF EXPERIMENTAL DETAILS 


(The double 


stimuli to which S was exposed. 
column lists the number of Ss represented by 
each datum point, and whether Ss were 


rank-order 


The fourth 


Fig, g Sets No. of Sets Ss Iter./S Exper. Proc. | Iter, Tech. 
2p - 1 30U 1 MS ER 
hh K CE AS 
3r : 2 11 R 3 cs AS 
ia 12R 3 CE AS 
4 O.P m 10-17 R m CE AS 
SE Q,R,S m 14-20 R m CE AS 
6 L,M,N m 12-18 R m CE US 
7 D dé m 12-18 R m Cs AS 
CP 
CE AS 
8 all 29 — |14R 1 CP GP 
9 ae — 25 U 1 CE SS 
10 -x 1 6-7 R 5 MS AS 
Ee L.X 1 15 U 1 IS ER 


N 
9'6.—m = mixed for different Ss. 
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British Naval Ratings (R) or Cambridge 
University students (U). The fifth column 
lists the number of successive iterations to 
which each S was exposed, where 1 indicates 
that separate Ss were employed for each 
iteration. The sixth column lists the experi- 
mental procedure: magnitude selection (MS), 
category estimation (CE), category selection 
(CS), category production (CP), or ideal 
selection (IS). The seventh column lists the 
iterative technique employed: equal-response 
frequencies (ER), adjusted spacing (AS), 
unequal stimulus frequencies (US), category 
production (CP), and single-stimulus pres- 
entations (SS). 

All testing was performed individually. 
The number of rating categories in the 
category-estimation and sorting tests was T? 
only a single selection was made in the 
magnitude and ideal selection tests. When 
successive iterations were carried out by the 
same S, revised stimuli were based upon the 
individual S's responses, Within each test 
series, the illuminating conditions and back- 
ground reflectance were constant. For ideal 
selection tests: the illuminating condition was 
outdoor, daylight, and the background re- 
flectance was 68%. Other tests with Cam- 
bridge University students employed night 
room light and diffuse lamp light. Tests with 
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British Naval Ratings employed spot illumi- 
nation with the bulb about 15 in. over the 
test surface. Magnitude-sclection tests em- 
ployed a white background (Paper No. 1) or 
a black background (Paper No. 29). Cate- 
gory-estimation and category-sclection tests 
employed a background reflectance of 68%. 
Experimental strategy —The basic strategy 
of experimentation was to deliberately intro- 
duce stimulus bias, by employing two or more 
weighted sets of stimu ;and then to attempt 
to remove, or neutralize, the stimulus by 
application of an iterative procedure. The 
application was considered successful when 
the same rating scale was obtained, despite 
large differences among the initial sets of 
stimuli. For example, three sets of grays with 
no intermediate overlap are represented in 
Fig. 3F. The three sets of grays led to sub- 
stantially different. initial category estima- 
tions, plotted as CE (1) in Fig. 3D where the 
three shapes reflect the three sets of stimuli of 
Fig. 3F. Application of the iterative technique 
of adjusted spacing upon the results of Fig. 3D 
led to three new sets of stimuli. The category 
estimations associated with the new sets are 
presented in Fig. 3E as CE (2). A single 
category estimation adequately describes the 
results of all sets. A further iteration, CE (3) 
in the form Xs, demonstrates that the cate- 
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Application of the technique of adjusted spacing for the category 


estimation of a restricted range of gray papers. 


ad uim is stable upon further itera- 
dess ib his totality of operations will be 
E ed as the convergence upon a stable 

gory estimation. By contrast, under the 
Procedures illustrated in Fig. 3A, B, G, 
Uccessful convergence was not achieved 
Within five successive iterations. 


RESULTS 


The main findings may be briefly 
Summarized: convergence upon a 
Stable empirical function is quickly 
obtained by all techniques under al- 
Most all of the conditions investigated. 

Onvergence may be substantially 
Slower where the following restrictions 
are simultaneously applied: a highly 
Weighted set of stimuli (Sets L, M, N) 
d narrow set of stimuli (Sets T 

Tough X); an experimental method 


which implicitly or explicitly imposes 
strong restraints upon S (sorting or 
selection) ; and, the application of the 
iterative technique to the individual 
S's results, rather than application to 
the average S's result. These condi- 
tions apply to the results of Fig. 
3A-C and Fig. 10. 

'The overview may be made more 
complete by the following additional 
details: 

Sample results for additional vari- 
ables are represented in Fig. 6 and 8. 
The initial category estimations for 
these variables were similar to those 
of Fig. 2A and 3D. 

In Fig. 2C, 3E, 4, aud 7, Xs repre- 
sent the results of an additional 
iteration which tested the stability of 
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the resulting empirical function. In 
additional tests, rapid convergence of 
the category estimations under the 
conditions of Fig. 2 was obtained 
whether separate groups were em- 
ployed for each iteration of each stim- 
ulus set, or whether a single group of 
Ss was followed over successive itera- 
tions. In additional tests, the slow 
rate of convergence of the category- 
sorting tests in Fig. 3A, B, and C was 
hastened by simply applying the 
technique of adjusted spacing to the 
pooled sortings across Ss, rather than 
to the individual data alone. 

The results of Fig. 4 suggest that 
well-defined stimulus end points, such 
as black and white, are not necessary 
for rapid convergence upon a stable 
category estimation. 

The rapid convergence of the cate- 
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gory estimations for each of the initial 
subpopulations of 21 papers of Fig. 
5A, B, C is not unexpected. More 
important when the third category 
estimations are plotted against the 
common scale of 29 papers, a common 
category estimation is achieved across 
subpopulations (Fig. 5D). The re- 
sults suggest that the category estima- 
tion associated with a given stimulus, 
resulting. from iterative techniques, 
will not be critically determined by 
the arbitrarily defined | population 
from which the set of stimuli is drawn. 
The population must only provide 
stimuli for interpolation to the tech- 
nique of adjusted spacing in regions 
of high discriminability. 

Unlike the other inserts which 
represent initial stimulus sets, the in- 
sert of Fig. 6 represents the distribu- 
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Application of the technique of ad 
apers within different initial subpopulations, 
distributions of Fig. 5E are presented in Fig. 


justed spacing for the category estimation of 
(The initial category estimations of the 
5A, B, and C in terms of cach subpopulation. 


» and C is plotted in Fig. 5D against the common 
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Application of the technique of single-stimulus presentation 


to the category estimation of gray papers, 


tion of unequal stimulus frequencies 
following the initial category estima- 
tions. In the case illustrated, two 
nonoverlapping sets of marker posi- 
tions were employed. 

The tests represented in Fig. 7 sug- 
gest that rapid convergence upon a 
stable category sorting (CS) is pos- 
sible if the initial stimulus sets are 
partially overlapping, rather than 
nonoverlapping as in Fig. 3. Category 
production (CP) from among the 
initially weighted Sets LRE yielded 
sets of stimuli which led directly to 
a stable category estimation (CE), 
based upon the previous category 
productions. : 

Category productions that were ob- 
tained with examples from the entire 
stimulus range led directly to stable 
category estimations for a wide range 
of variables (Fig. 8). : 

The ratings obtained with the tech- 
nique of single-stimulus erue eee 
(Fig. 9) are in fair agreement with the 


mean category estimations after sev- 
eral iterations with a set of 11 grays. 
"There was an apparent tendency for S 
to withhold extreme ratings to the 
initial stimulus Presentation. If the 
initial ratings are adjusted to the 
extremes of the rating scale, closer 
agreement is obtained between the 
two techniques. Comparison between 
the techniques is made difficult by 
differences in category estimations 
furnished by the first and second runs 
through the set of 11 grays. 
Convergence is slow when S must 
make selections from a narrow stim- 
ulus range (Fig. 10). The slow rate of 
convergence must be viewed in light 
of the severe restrictions imposed by 
the narrow stimulus range. A portion 
of the range of grays (indicated by the 
stippled area of Fig. 10) was unattain- 
able, even after five successive extreme 
selections. For example, if the set of 
Seven grays were centered on Paper 26 
for the first selection, Paper 10 was 
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still not attainable after five successive 
selections. The reference line in Fig. 
10 indicates the result expected had 
the initial sets completely determined 
the final result. 

The median selections for “ideal” 
gray were Papers 11, 12, and 12.5 for 
Sets 1, J, K, respectively. Presenta- 
tion of a new set of stimuli, obtained 
by the technique of equal-response 
frequencies from the initial ideal 
Selections to Set 1, yielded a median 
selection at Paper 12. The results 
Suggest that convergence upon a 
Stable ideal selection may be quickly 
achieved by iterative procedures. 
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sep ttainable conditions even after five success 
€n. Within each vertical line of points, the 


the left; the black background [Paper No. 29] on the right. 


Dr 


At first, the several iterative tech- 
niques appear to act in a magical way to 
fashion stable rating scales. There is 
nothing magical about their action. In 
the technique of adjusted spacing, for 
example, new stimuli are obtained on 
successive iterations. The rating scales 
become more and more alike on succes- 
sive iterations because the stimuli become 
more and more alike. Perhaps the only 
requirement for convergence upon a 
stable rating scale is that S's responses 
reflect something more than the rank- 
order properties of the stimulus set alone. 

The failure of the category-sorting 
procedure with a wide range of nonover- 


16 
INITIAL RANK-ORDER DARKNESS 


Application of a restricted form of the technique of adjusted spacing to the 


[9] 8 24 


white background [Paper No. 1] are presented 


Papers 4, 9, 15, 21, and 26, respectively. The 


ameter is the number of successive selections. The shaded areas represent the range of 


ive selections of the extreme gray of each set of 
median of the individual selections is presented.) 


Seven successive papers were 
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lapping grays probably relates to the 
rank-order constraints imposed by the 
category-sorting procedure. P In order to 
investigate this point, additional tests 
were carried out with sets of 11 cards 
marked with the numerals 1, 3 sg 3 99, 
according to Sets L, M, and N. The Ss 
(Royal Naval Ratings) were instructed 
to sort the cards into seven, or fewer, 
piles so that the “average numerical 
distance from pile to pile was approxi- 
mately constant." The instructions 
were apparently disregarded because 
large gaps in the numerical ratings were 
not reflected in the sortings. A typical 
result for sorting the numerals of Set L 
into successive piles was: 1 & 2; 3; 
4&5;6;7 &8;9 & 10; 20. That is, S 
interpreted the instructions in such a way 
as to produce seven, and no fewer, piles 
whether dealing with grays or numerals. 
The agreement between the rank-order 
brightness and the rank-order numerical 
designation was high. Over 75% of the 
sorting categories agreed exactly for Sets 
L, M, N and over 65% of the sorting 
categories agreed exactly for Sets I, J, KK 
In less than 2% of the cases did numerical 
and brightness-category ratings differ by 
more than one category rating. This 
result suggests that the category-sorting 
method closely reflects the rank ordering 
of the stimuli. It may be noted that 
even with the strong restrictions of the 
category-sorting procedure, convergence 
upon a stable rating function may be 
achieved if the initial sets of stimuli are 
overlapping (CS tests of Fig. 7), or if 
successive iterations are based upon the 
pooled responses of several Ss rather than 
upon the responses of individual Ss. 

The minor difficulties with the excep- 
tional cases, however, should not lead us 
from the main result: rapid convergence 
upon an unbiased scale is achieved by a 
wide range of iterative techniques, for a 
wide range of stimulus variables and for 
a wide range of psychophysical methods. 
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REWARD VERSUS NONREWARD IN A SIMULTANEOUS 
DISCRIMINATION ! ` 


R. ALLEN GARDNER 


? AND. W. B. COATE °? 


Wellesley College 


The procedure of a 2-choice discrimination was varied in 2 wavs. 
Under M+ conditions, 60 rats were trained on problems in which S+- 
varied from trial to trial while the same S— was presented on every 


trial. Under M— 


conditions, 60 additional rats were trained on 


problems in which S— varied from trial to trial while the same S+ was 


presented on every trial. 


Performance was poorer under the M -+ condi- 


tion than under the M — condition, indicating that reward for choices 
of a specific S+ was a more critical determinant of discriminative per- 
formance than was nonreward for choices of a specific S—. 


In a standard procedure of simul- 
taneous-discrimination training the 
Same two discriminanda are presented 
9n every trial, and choices of S+ are 
Tewarded while choices of S— are not 
rewarded. Discrimination learning is 
Measured by increases in R+, the 
Probability of choosing S+, or, sym- 
Metrically, by decreases in R —, the 
probability, of choosing S—. With 
eid n discriminanda and two re- 
tiva of Categories, however, the rela- 
sini ects of reward for choices of Sc 
ngos LUCA for choices of S— 
o e evaluated because increases 
With d are completely confounded 

€creases in R —. 

€ the present experiment, two 
Were Ms of the standard procedure 
ple ot One variant was a Mul- 
Which oe procedure (M+) Us 
Were A erent positive discriminan a 
the & Presented on different trials, but 
trial ame S— was presented on every 

: The second variant was a 
in “\tiple-Negative procedure . (M =) 
ha uch different negative discrimi- 

a were presented on different 
Ma This research was supported by Grant 
from the National Institute of Mental 
» United States Public Health Service. 
a ial at the University of Nevada. i 
urh at Hazleton Laboratories, Falls 

» Virginia. 


He, 
a 
2 lth 
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trials, but the same S+ was presented 
on every trial. Both M+ and M— 
should be more difficult than the 
standard procedure in which the same 
S+ and the same S— are presented 
on every trial. But, of the two mul- 
tiple procedures, which should be the 
more difficult? In M+, additional 
diffculty is introduced via the condi- 
tions of reward training; in M—, via 
the conditions of nonreward training. 
We reasoned that the relative diff- 
culty of M+ and M— should reflect 
the relative importance of reward 
training and nonreward training, if 
the two multiple procedures could be 
equated in all other respects. Accord- 
ingly, the following experiment was 
designed to compare the M+ and 
M — procedures under a specific set of 
experimental conditions. 


METHOD 


Subjects.— The Ss were 120 male albino 
rats of the Sprague-Dawley strain, 90-120 
days old, purchased from the Charles River 
Breeding Laboratories. 

Apparatus.—The experimental compart- 
ment used for Discrimination Training is 
shown in Fig. 1. It consisted of a choice 
chamber and a treadle chamber connected 
by a narrow passageway and covered by a lid. 
The choice chamber was divided in half by a 
partition which was attached to the front 
wall. Two 2.25-in. circular openings (P) were 
cut out of the front wall of the choice chamber 
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Fic. 1. Diagram of the experiment 


on either side of the Partition, Each opening 
was covered by a translucent Plexiglas panel, 
hinged at the top so that it could be pushed 
outwards to actuate a microswitch. The two 
panels also served as screens on which the 
discriminanda were projected from behind by 
an automatic filmstrip Projector (not shown). 
When the panels were dark, both could be 
lighted by momentarily depressing the treadle 
(T), and both were darkened again by the 
first push on either panel. A correct choice 
was rewarded with a bead of water, delivered 
automatically by a solenoid operated dipper 
through an aperture (D) in the floor of the 
choice chamber. Actuation of the dipper 
solenoid was accompanied by a flash of light 
on the correct panel, and a cutout in the 
partition permitted § to take rewards from 
either side. 

A second compartment, used for Prelimi- 
nary Training, had the same floor plan as the 
compartment shown in Fig. 1. The front wall 
of the Preliminary Training compartment 
could be removed and replaced by a front 
wall with only one circular opening and panel 
in its center. The panels used in Preliminary 
Training were lighted by lamps from outside 
the compartment, and when lighted, these 
panels presented a homogeneous white field, 

The experimental rooms were maintained 
in darkness with a sound screen of white 
noise, and the automatic control and record- 
ing apparatus were kept ina separati room. 

Deprivation.—Dry food was available in 

living cages at all times, For 6 days 
the living inning of Preliminary Training 
before the beginning y 
E ; was placed on a 23-hr. schedule 
EIU. ‘ rivation. During Preliminary 
of walter 4 piserimination Training each .S 
1 raming an m cess fo water in his living 
was given free acces: 


al compartment used for Discrimination Training. 


cage for 45 min. each day, and wi 
deprived of water at the start of ed 
training session. 

Fialinigeny Training.—Each S was gs 
dled once daily on each of 4 days before te 
beginning of Preliminary Training. E do 
nary Training consisted of five, 35-min., E do 
sessions of training which were designec 
approximate, by stages, the procedure 2 
Discrimination Training. During Days 1-3, 
the single-paneled front wall of the choice 
chamber was in Place. On Day 1, S e 
confined to the choice chamber by a block 
placed at B in Fig. 1. The single panel n 
mained lighted throughout the session and à i 
panel pushes were rewarded. On Days 2 anc 
3, the block was removed and .S was free to 
move through the passageway between the 
choice chamber and the treadle chamber. 
When the panel was pushed while it wa 
lighted, a reward was delivered and the pane 
was immediately darkened, ‘The panel was 
relighted when S5 left the choice chambi 
entered the treadle chamber, and dep 
the treadle. On Days 4 and 5, the single 
paneled front wall was removed and replace à 
by the double-paneled wall with its partitior 
as shown in Fig, 1, Now, when S ey 
the treadle, one panel was lighted and Ld 
other remained dark. Pushing the lights 
Panel was rewarded, pushing the dark pane 
was not rewarded, and pushing either pane! 
darkened both. 32 

iscrimination Training.—There were 432 
trials of Discrimination Training, 48 on the 
first day and 96 on each of 4 succeeding days. 
The discriminanda were alternating black 
and white stripes, two black and three white 
and each stripe was js in. wide at the panel. 
The discriminanda differed only in the angular 
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orientation of the stripes, and four angles 
were used: 0°, 4: 20°, and 135°. One of the 
angles (Key Angle) was projected on one 
panel, and one of the three remaining angles 
(Alternative Angle) was projected on the 
opposite panel during each trial. Each Alter- 
native Angle was presented on one third of 
the trials, the sequence of presentation and 
left/right placement of the Alternative Angles 
being counterbalanced and evenly distributed 
over each block of 48 trials. 
xperimental design.—Malf of the 120 Ss 
were trained under the Multiple-Positive 
(M-+-) condition, in which all choices of the 
s were rewarded and no 
y Angle were rewarded. The 
Other half of the Ss were trained under the 
Multiple-Negativ e (M —) condition, in which 
all choices of the Key Angle were rewarded 
and no choices of the Alternative Angles were 
rewarded. Each of the four angles was used 
as the Key Angle for an equal number of Ss 
în both conditions. Fifteen Ss were randomly 
assigned to each of the resulting eight sub- 
Rroups in such a way that the time of day and 
Week of the experiment during which they 


Were trained were roughly equated. 

Thus, each M + animal had, in effect, 
three discrimination problems to solve con- 
currently, cach with a dillerent S+ but all 
three With the same S—; while, subgroup for 
Subgroup, each M — animal had three equiv- 
alent Problems, all with the same S+ but each 
With a different S—, 


RESULTS 


During the last 48 trials of Dis- 
“mination Training (Trials 385-432) 
the median number of errors was 10.0 
lor the M-+ group and 3.5 for the M— 
group. By the median test (Siegel, 
1956, pp. 111-116) this difference 
Yielded x? (1) = 5.63, p < .025.* Per- 
Ormance also depended upon which 
Angle served as the Key Angle, x? (3) 
7 33.1, p < .01; the discrimination 
Problems were more difficult when the 
Key Angle was 45° or 135° than when 
it was 0° or 90°. There was less 
difference between the M+ and M— 
Conditions when the Key Angle was 

° or 135°, but this was confounded 


d * Analysis of variance performed on these 
ata yielded an F (1, 112) of 6.11, p < .025. 
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by the fact that poorer performance 
during the last 48 trials was also 
closer to chance and hence less reliable. 

Although training was terminated 
after 432 trials, the results were also 
tabulated in terms of trials to cri- 
terion. For each S, the number of 
trials required to make a given 
number of correct choices out of any 
48 consecutive trials was tabulated 
for each criterion from 25,48 to the 
best performance reached within the 
allotted 432 trials. The median num- 
ber of trials required for each criterion 
from 25/48 to 45/485 was greater for 
the M+ group than for the M— 
group, and this difference was signifi- 
cant at the .05 level or better for each 
criterion from 38,48 to 44.48 by the 
median test. Since the criterion of 
44/48 was reached by half of the Ss 
in the total sample, we can say that 
the difference would have been at 
least as significant if all 120 Ss had 
been run to a criterion of 44 48. 


DISCUSSION 


Assuming that reward for each choice 
of a particular S+ increases the proba- 
bility of choosing that S+ more than it 
increases the probability of choosing any 
other stimulus object, the optimal stim- 
ulus condition for reward training would 
be one in which precisely the same S+ 
were presented on every trial so that all 
rewards would be received for choices of 
that S+. Thus, the effectiveness of the 
rewarded trials was reduced by the M + 
procedure because different positive dis- 
criminanda were presented on different 
trials. Similarly, the effectiveness of the 
nonrewarded trials was reduced by the 
M — procedure because different negative 
discriminanda were presented on differ- 
ent trials. In all other respects, the M+ 
and M-— discrimination problems were 
equated. 


545/48 was the highest criterion that was 
reached or exceeded by half of the M — group. 
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Under the conditions of the present 
experiment, poorer performance was 
found under the M+ procedure than 
under the M— procedure. This finding 
is consistent with models of discrimina- 
tion learning which assign greater weight 
to the effects of reinforcement than to the 
effects of extinction (Estes, 1959; Spence, 
1936); but it is not consistent with Har- 
low's (1959) proposition that discrimina- 
tion learning can be described as a 
"uniprocess" consisting of inhibitory 
learning, or with Amsel's (1962) conclu- 
sion that "avoidance of nonreward is a 
more powerful factor in discrimination 
than approach to reward [p. 309].” 
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EFFECTS OF FOOD AND WATER DEPRIVATION 
ON THE PERFORMANCE OF A RESPONSE 
MOTIVATED BY ACQUIRED FEAR? 


RONALD LEY? 


Syracuse University 


On the basis of Hull's drive-summation hypothesis, it was predicted 
that a fear-motivated response would be augmented by the addition of 
an irrelevant appetitional need or a combination of irrelevant needs and 
that the level of augmentation would be a direct function of the level of 
deprivation. 84 rats were fear conditioned, after which they were put 


on a 23-hr. food- and water-deprivation schedule. 


On the 7th and Sth 


days following this, Ss were given 25 trials of fear-motivated hurdle 
jumping under 6 different combinations of food and water deprivation. 
Median reciprocals of latency measures indicated that the rank order of 
performance on the 2nd day of testing and on the last block of 5 trials 
was consistent with Hull's drive-summation hypothesis. 


The trends of the results of studies 
designed to measure the effects of 
irrelevant needs suggest that needs 
combine to increase motivation as 
measured by changes in performance 
(Amsel, 1950; Meryman, 1952). Such 
exceptions to this generalization as do 
occur seem to involve the combination 
of hunger and thirst (Bolles, 1961; 
Kendler, 1945; Verplanck & Hayes, 
1953). Examination of these studies 
reveals that the response measure em- 
Ployed in each instance was either a 
consumatory response or an instru- 
mental response learned on the basis 
of one of these needs. Since it seems 
reasonable to assume that hunger and 
thirst are not independent, it is con- 
Ceivable that the failure to find a 
Motivational effect is the result of its 

‘This paper is based upon a dissertation 
Submitted to the Graduate School of Syracuse 

Niversity in partial fulfillment of the require- 
Ments for the PhD degree. The author 
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being masked by other factors such as 
drive-stimulus generalization decre- 
ments. Thus, if a procedure were 
used where the possibility of such 
decrements was minimized, it might 
be possible to measure the pure ener- 
gizing effects of the combination of 
hunger and thirst if such exist. If 
appetitional needs combine with aver- 
sive needs as Amsel's and Meryman's 
findings suggest, it seems feasible to 
use the typical paradigm of acquired 
drive studies for this purpose. The 
present study attempted to measure 
the effects of food deprivation, water 
deprivation, and combinations of food 
and water deprivation on the per- 
formance of a response motivated by 
acquired fear. On the basis of Hull's 
(1943) drive-summation hypothesis it 
was predicted that a fear-motivated 
response would be augmented by the 
addition of an irrelevant appetitional 
need or combination of irrelevant 
needs and that the level of augmenta- 
tion would be a direct function of the 
level of deprivation. 


METHOD 


Subjecs.—The Ss consisted of 84 naive, 
male, hooded rats from the colony maintained 
at the Psychology Department of Syracuse 
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University, 91-128 days of age on their first 
experimental day; average age was 108 days. 

A pparatus.—F ear conditioning was per- 
formed in two separate white, wooden framed 
shock boxes, the interior dimensions of which 
were 93 in. long X 2} in. wide X 5 in. high. 
The floor of each box consisted of 21 #5-in. 
brass welding rods spaced 7g in. apart through 
which a 115-v. electric current (UCS) could be 
delivered at the rate of two impulses per grid 
per sec. A 100,000-ohm resistor was in series 
with each S. Hinged to the rear wall of each 
shock box was a 12-in. high box, the hardware 
cloth bottom of which served as the ceiling 
of the shock box. Mounted ł in. above the 
hardward cloth bottom was a pane of opal 
glass which provided diffuse transmission of 
light from the tunnel to the interior of the 
shock box. Concealed within the tunnel were 
the CS, a 40-w. incandescent lamp situated 
8 in. above the opal glass, and the intertrial 
light, a 73-w. incandescent lamp situated 10 
in. above the opal glass. The illumination 
provided by the CS was approximately 120 
footcandles (ftc.) and the intertrial illumina- 
tion, 73 ftc. 

The hurdle-jumping apparatus consisted of 
two compartments separated by a 4-in. 
Partition containing a guillotine door 2} in. 
wide X 3 in. high which rested on a 2-in. high 
hurdle. The start box was built to be 
identical to the shock boxes, whereas, the safe 
box was gray and had a plywood floor. Rais- 
ing of the manually operated guillotine door, 
closed a switch which resulted in the simul- 
taneous activation of the CS and Hunter 
klockounter. A second switch was located 
beneath the safe-box floor. The weight of S 
on the floor closed the switch which simultane- 
ously turned off the CS and stopped the 
klockounter. 

Design.—The overall design of the study 
involved the fear conditioning of rats through 
the pairing of an increase in illumination (CS) 
with a 115-v. shock (UCS) delivered to Ss by 
means of a grid floor. Immediately following 
35 fear-conditioning trials, conducted while 
Ss were maintained on an ad-lib, feeding 
schedule, all 5s were put on a 23-hr. limited 
time food- and water-deprivation schedule. 
Seven days later each S was tested in the 
hurdle-jumping apparatus, 25 trials per day, 
for 2 days under different levels of hunger 

and thirst. Response latencies for each S on 

h trial were converted to reciprocals, and 
each iprocals for each S were computed in 
Te a s rial thus providing for five 
blocks 7 § téials the first day of testing and 
ndis of 5 trials the second day of 


testing. 
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Procedure.— he 84 Ss used in the experi- 
ment were introduced to the experimental 
setting in squads of 12, 2 Ss of each squad 
randomly assigned to one of the six experi- 
mental groups. During the 11 days required 
for the administration of the experimental 
conditions, Ss were maintained in individual 
living cages. 

On Days 1 and 2 Ss received 73 min. of 
handling and 20 min. of exploration of the 
hurdle-jump apparatus with intertrial light 
on; 10 min. in the start box and 10 min. 
in the safe box. . 

On Day 3 each pair of Ss was administered 
35 fear-conditioning trials. Following random 
gnment to the pair of shock boxes, the 
first trial began 1 min. after Ss were placed in 
the boxes. Each trial began with the onset 
of the CS, 4 sec. after which the UCS was 
presented for a 2-sec. duration with CS and 
UCS terminating simultaneously. Intertrial 
interval was 2 min. Following the last fear- 
conditioning trial for the last pair of Ss in 
each squad, food and water were removed 
from the home cages of cach S in that squad. 
For the ensuing 5 days each S in the squad 
was maintained on a limited time feeding 
schedule with free access to food and water for 
a daily 1-hr. period. . 

On Day 9 (the sixth day of deprivation) 
the 1-hr. feeding period was adjusted so that 
it began exactly 24 hr. before the scheduled 
test trials of the next day, Since test trials 
were scheduled at 1-hr, intervals, the feeding 
schedule of Day 9 was similarly staggered at 
1-hr. intervals coinciding for each group with 
their scheduled testing period. ‘This resulted 
in slight variations from the regular feeding 
time ranging from 1 to 3 hr. 

To provide for the designated combinations 
of levels of food and water deprivation at the 
onset of testing, the various treatment groups 
received food and water for 1-hr. periods on 
Day 10 in the following manner: Group. c 
(23H-23T), food and water the hour following 
test trials; Group F (OH-OT), food and water 
the hour preceding test trials; Group 
(23H-OT), water the hour preceding test 
trials and food the hour following test trials; 
Group E (OH-23T), food the hour preceding 
test trials and water the hour following test 
trials; Group B (23H-12T), water for a 1-hr. 
Period 12 hr, before testing and food the hour 
following testing; Group D (12H-23T), food 
or a l-hr. period 12 hr. before testing and 
water the hour following testing. 

Each of the 25 test trials of Day 10 
consisted of placing the first S of the pair in 
the start box of the hurdle-jumping apparatus, 
10 sec. after which E raised the guillotine door 
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Mic. t, Hurdle-jumping performance 
iiie simultaneously activated the CS and 
durin ater: Fhe l cS was not presented 
hurdle trials, When S jumped the 
the sabe a depression of the floor switch of 
the PEA terminated the CS and stopped 
GE es s Router, thus providing a measure 
Were Ea latency. | The guillotine door was 
box, wl toitowing S's entrance into the safe 
he paa here S remained for 10 scc., after which 
while yada and placed in a holding cage 
ion the second Sof the pair was subjected 
Tun ne hurdle-jump conditions. If S did not 
st P within 60 sec., 5 was removed from the 

art box, placed in the holding cage, and a 
latency of 60 sec. was recorded. 

On Day 11, the final experimental day, 
Sach S was given 25 test trials identical in 
Procedure to those administered on Day 10, 
except that the test period for each pair of Ss 
Was 1 hr, later than on the previous day. 


RESULTS 


The data of the experiment are 
Presented as two separate but parallel 
Studies. One study includes the data 
of four groups, three of which were 
tested under 23 hr. of water depriva- 


as a function of the addition of hunger to thirst. 


tion but under different levels of food 
deprivation; Groups E (0H-23T), 
D (12H-23T), and C (23H-23T); and 
one of which was tested under 0 hr. of 
food and water deprivation, Group 
F (0H-0T). The other study also in- 
cludes the data of four groups, three of 
which were tested under 23 hr. of food 
deprivation but under different levels 
of water deprivation; Groups A 
(23H-0T), B (23H-12T), and C 
(23H-23T) ; and Group F (0H-OT). 
This mode of presentation was 
adopted because it most clearly in- 
dicates the effect on performance of 
increasing hunger while thirst is held 
constant in the one case and of in- 
creasing thirst while hunger is in- 
creased in the other. 

The performance medians of the 
23-hr. water-deprived groups plus the 
0H-OT group (Groups E, D, C, and F) 
on Day 2 and Trial Block 10 are given 
in Fig. 1. These two periods were 


586 


selected because it was felt that they 
would be most sensitive to the effects 
of the treatments. Since acquisition 
of the hurdle-jump response must be 
established before the effects of the 
irrelevant drives can be evidenced in 
the speed of hurdle jumping, the early 
trials of Day 1, wherein the hurdle- 
jump response is less likely to be well 
established, will not as clearly reflect 
the effects of the addition of the 
irrelevant drives. The obtained rank 
order of performance is exactly that 
which was predicted, i.e., the level of 
augmentation of the fear-motivated 
response is a direct function of the 
level of deprivation. Since the pre- 
dictions were made with respect to the 
relative order of treatment effects, a 
Jonckheere (1954) "'distribution-free 
k-sample test against ordered alterna- 
tives" was performed on the perform- 
ance measures of Day 2 and Trial 
Block 10. In both cases, the obtained 
order of performance was consistent 
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with the predicted order. The data 
of Day 2 gave an S of 364 which 
when converted, resulted in a ¢ (134) 
of 2.577, p < .025. The data of Trial 
Block 10 gave an S of 400 which 
when converted, resulted in a ¢ (134) 
of 2.793, p < .005. 

The performance medians of the 
23-hr. food-deprived groups (Groups 
A, B, and C) plus Group F (0H-0T) 
on Day 2 and Trial Block 10 are given 
in Fig. 2. The obtained rank order of 
performance was again exactly in 
keeping with the predicted order. A 
Jonckheere test of the data of Day 2 
resulted in an S of 302, ¢ (134) 
= 2.167, p < .025; Trial Block 10 
resulted in an S of 298, ¢ (134) 
= 2.138, p < .025, thus indicating 
that the obtained order of perform- 
ance was consistent with the predicted 
order. : 

In Fig. 3 the means of the recip- 
rocals of latency of hurdle jumping 1" 
blocks of 5 trials are plotted for the 


DAY 2 
TRIAL BLOCK 10 


B(23H-12T)  C(23H-23T) 
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Hurdle-jumping performance as a function of the addition of thirst to hunger. 
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Hurdle-jumping performance as a function of trial blocks for groups 


in which hunger has been added to thirst. 


me in which hunger was added to 
suirst (Groups E, D, C, and F). The 
amm o of a trend analysis of vari- 
De, over the 10 blocks of trials 
Pig i se Conditions X Trials) is 
cane in Table 1. On the basis of the 
that ae of Fig. 3 it would appear 
ns significant Trial factor re- 
Perfo rom an overall improvement 1n 
trial rmance over the 10 blocks of 
e $, thus providing evidence that 
ning occurred. Although the main 
s Privation effect did not result in a 
Snificant mean square, an analysis of 
Depp us Between Groups effect was 
an d e on the means of Day 2 
- Trial Block 10, since the inter- 
ion mean square was significant. 
si Sither of these analyses resulted in 
I£nificance, 
e Fig. 4 the means of _the recip- 
i of latency of hurdle jumping in 
r cks of five trials are plotted for the 
^n in which thirst is added to 
ec (Groups A, B, C, and F). A 
ap pary of the trend analysis of 
Tiance for these data is given In 


Table 2. It would appear that the 
significant Trials factor resulted from 
an overall improvement in perform- 
ance over trials, thus indicating that 
learning occurred. 


DISCUSSION 


The augmentation of a fear-motivated 
response through the addition of irrele- 


TABLE 1 
"TREND ANALYSIS OF VARIANCE OF PER- 
FORMANCE OF GROUPS F (OH-OT), 
E (OH-23T), D (12H-23T), AND 
C (23H-23T), OVER THE 10 
Brocks or FIVE TRIALS 


Source df MS F 
Between .Ss 55 
Deprivation (D) 3 | 4.8652 | 2.0604 
Error (b) 52 | 2.3612 
Within Ss 504 
Trials (T) 9 | 1.2734 |13.1686** 
TxD 27| .1836 | 1.8986** 
Error (w) 468 | .0967 
Total 559 
**$ <.01. 
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vant appetitional needs was substanti- 
ated by the data. The results of the 
Jonckheere tests clearly upheld the pre- 
dictions generated by Hull's drive-sum- 
mation hypothesis, that the level of 
augmentation of the fear-motivated re- 
Sponse would be a direct function of the 
level of deprivation. Within the frame- 
work of Hull's theory, it would appear 


TABLE 2 
TREND ANALYSIS op VARIANCE OF Prr- 
FORMANCE OF Groups A (23H-0T), 
B (23H-12T), C (23H-23T), and 
F (0H-0T) oven THE 10 
BLOCKS or Fry; 


TRIALS 
Source df | Ms F 

tween Ss 55 

poo o (D)| 3 | 3.3543 | 1.2547 
Error (b) 52 | 2.6733 
pene 504 

W ithin Sa 9 | 1.1829 |11.0551** 
TE D 27 | .1522 | 1.4224 
Error (w) pi SEM 

Total "à 
**$ <.01. 


Hurdle-jumping performanc 


€ as a functio 
in which thirst has been 


n of trial blocks for groups 
added to hunger. 


that the addition of various levels and 
combinations of levels of irrelevant needs 
Summated with the relevant fear drive, 
thereby increasing D and resulting 1n 
heightened performance as evidenced in 
shorter response latencies, " 
In keeping with the collateral studies 
of Amsel (1950) and Meryman (1952), 
the fear-motivated response of the pres- 
ent study was augmented by the addition 
of irrelevant appetitional needs. Unlike 
Amsel's Study, however, wherein the 
latency of response of the 22-hr. food- 
deprived Ss was shorter than that of the 
satiated group, the present study showed 
that the performance of the 23-hr. de- 
Prived groups (A and E) was not signifi- 
cantly different from the satiated group 
(F). This difference between Amsel's 
findings and the findings of the present 
study is probably the result of a differ- 
ence in procedure. Amsel shocked his Ss 
while they were experiencing deprivation, 
whereas in the present study, Ss were 
shocked previous to undergoing depriva- 
tion. Further, the test day on which 
Amsel found the significant difference 
between groups was the day following 
shock, whereas in the present study, the 
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crucial test trials did not occur until 8 
days after shock. It may be that the 
heightened performance of Amsel’s de- 
prived groups is attributable to an in- 
both H and D. Since Ss of 
Amsel's study were deprived during both 
fear conditioning and testing, the total 
stimulus complex experienced by his Ss 
would be more similar from training to 
testing than would that experienced by 
Ss of the present study. Recency would 
also favor Amsel's procedures, since the 
considerably shorter delay between train- 
ing and testing would not provide as 
much opportunity for the development 
of competing responses. Considering 
these differences in procedure it would 
seem that those followed by Amsel are 
more likely to show the effect of the addi- 
tion of one irrelevant need. 

With respect to the most severely de- 
Prived group, the findings of the pres- 
ent study are exactly opposite those of 
Kendler (1945). Since Kendler's study 
involved the use of an instrumental re- 
sponse reinforced solely by food, it may 
be that his failure to find a motivational 
effect for his most severely deprived 
group (22 hr. food deprivation plus 22 hr. 
Water deprivation) resulted from a ditfer- 
ence in the effectiveness of the reinforcing 
agent during acquisition, i.e., the incen- 
tive value of the food reinforcement may 
have been greater for the less severely 
Water-deprived Ss than for the 22-hr. 
Water-deprived group, thus offsetting the 
energizing effects of deprivation. This 
interpretation is consistent with the data 
of Bolles (1961) which suggest that when 
Water is restricted, food intake is reduced. 
Since it is unlikely that hunger and thirst 
are independent, any study designed to 
Measure the effects of combinations of 
appetitional needs may be confounded if 
the dependent variable is either a con- 
Summatory response or an instrumental 
response learned on the basis of thirst or 
lunger, 

Accounting for the results of the pres- 
ent study from Estes’ theoretical position 
Presents certain problems. Since the 


crease in 
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fear response elicited by the CS was 
learned before Ss were subjected to dep- 
rivation, i.c. under conditions of satia- 
tion, it would appear that the stimulus 
complex giving rise to fear during test 
trials should be less similar for the more 
severely deprived Ss. Thus, the more 
severely deprived Ss should be less fear- 
ful. If fear is the primary motivational 
condition for the hurdle-jump escape 
response, it would seem that Estes’ 
theory would predict that performance 
should be in an order the opposite from 
that obtained in the present study. It 
might be argued, however, that since 
hurdle jumping is an instrumental re- 
sponse, the additional drive stimuli (S4) 
provided by the conditions of deprivation 
supplement the population of stimuli 
which can be sampled during the course 
of acquisition of the hurdle jump, thus 
allowing for predictions consonant with 
the data of the present study. 
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RECOGNITION MEMORY FOR RANDOM SHAPES AD ok 
FUNCTION OF COMPLEXITY, ASSOCLATION 
VALUE, AND DELAY?! 
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12 groups of 24 Ss each were examined on 2 tests at 3 delays for recogni- 


tion accuracy of random shapes h 


levels of association value (A). 
taken. AY 
low A, and a 2nd recognition test o 


significant. 


Forms of high A were mor 


Complex form coding was Positively related to A. 
form coding was infrequent and not related to A. 
related to recognition accuracy for complex forms only. 


aving 2 levels of complexity and 2 
A measure of form coding was also 
accurately recognized than forms of 


n the same forms resulted in a practice 
effect for simple but not for complex forms. 


All delay effects were in- 
Simple 
Coding was positively 


were: (a) Ss probably stored an uncoded image of the entire shape of 


simple, but not of complex forms 


remembered by making associations to them. 


The purpose of this experiment was 
to determine the effects of form com- 
plexity (Attneave, 1957), form asso- 
ciation value (A) (Vanderplas & 
Garvin, 19593), and delay between 
initial and subsequent presentation of 
a form on recognition accuracy. Four 
experimental outcomes, based pri- 
marily on conclusions arrived at by 
Deese (1956), were predicted. 


1. With increases in delay time, 
recognition accuracy was expected to 
decrease more for complex forms than 
for simple forms, because complex 
forms contain more information than 
simple forms. 

2. With equally low A's for complex 
and simple forms, it was expected 
that, at the shortest delay, complex 
forms would be more accurately 
recognized than simple forms. Higher 
accuracy was predicted for complex 
forms because they are presumably 

dissertation submitted in 
aeta fléllment of the requirements for a 


in psychology at the University 
Eee wie nels was under the direction 
o i 


E ium po A Medical Research 


Laboratories, Wright-Patterson Air Force 
a » 
Base, Ohio. 


more discriminable than simple forms. 

3. With equally high A's for com- 
plex and simple forms, at the shortest 
delay, the two types of forms were 
expected to be identified equally well. 
Deese (1956) suggests that “If the 
Observer is able to code forms with 
previously stored information, the 
information conveyed in the forms 
may make little difference in a test of 
recognition [p. 37." 

4. Forms of high A were expected 
to be more accurately recognized than 
forms of low A. Evidence for the 
incremental effect of association value 
on memory has been supplied by 
Vanderplas and Garvin (1959b), Pet- 
erson, Peterson, and Miller (1961), 
Murdock (1961), Bousfield and Cowan 
(1962), and Hilgard (1962). 


METHOD 


Subjects —One hundred and fifty-four 
female and 134 male students selected from 
the University of Illinois elementary psy- 
chology S pool served in this experiment as a 
Course requirement, In all but one group, the 
Sexes were divided about equally. 

Materials. —A Kodak Carousel slide pro- 
jector with a shutter which controlled expos- 
ure duration projected the stimuli onto a 


590 


FACTORS IN RECOGNITION MEMORY FOR SHAPES 


591 


TABLE 1 


Test STIMULI 


High Medium Low 
Supe forms — | —€—— E88 
6 point Siini | $1516 18 2 | 131920 22: 30 
Complex forms ud " "m 
Ape BIERO SETA | 10,18; 167 22, 28 


3, 8, 11, i2 


3 X 4ft. white screen. Image size was such 
that each could be contained within a circle 
having a 1-ft. diameter. The projector illumi- 
nation selector was set at the 500-w. position. 
Sixty randomly constructed shapes mounted 
on 2 X 2 in. slides were the stimuli. These 
Shapes had 4, 6, 16, or 24 points, and were 
selected from a group of 180 uncurved shapes 
Constructed by Vanderplas and Garvin 
(19593). Four- and 6-point shapes are re- 
ferred to he imple forms, while 16- and 
ay eine shapes are called complex forms. 
ie forms selected are listed in Table 1 by 

eir Vanderplas and Garvin identification 
number, 

Phere were 30 complex and 30 simple 
rei Within cach category there were 10 
ques each of high, medium, and low A. The 

measures were taken from Vanderplas and 
or 60.4 ( 19592), and represent the percentage 
Le s who gave an associative response to a 

Sec. visual presentation of the form. 

e attempt was made to match the 
ee the 4-point forms with the 6-point 
B IS the 16-point forms with the 24- 
ee orms. An effort was also made to 
Match the means and variances of simple and 
Complex forms of high, medium, and low A, 
respectively, The obtained values are 
Presented in Table 2. 

Procedure.—Twelve groups of 24 Ss each 

Were examined for recognition accuracy In an 
analysis of variance design which included 
hree levels of delay (5, 10, and 20 min.), two 
*vels of form complexity (simple-complex), 
and two levels of A (high-low). The Ss 
Within each group were administered two 
recognition tests. : 

Each group was tested separately during 
3pproximately a 1-hr. session. The Ss sat 
a classroom 10-20 ft. from the screen. All 
Sessions had two major phases. During 
"hase I, Ss viewed 10 forms—each form 
Presented once—read a newspaper during a 
clay period, and then attempted to identify 

€ 10 learned forms in a forced-choice 


recognition test in which the 10 learned forms 
were paired randomly with 10 new forms. 
A questionnaire on the newspaper content 
was administered following the recognition 
test. Phase lI was a replication of Phase I 
with three exceptions: (a) The same 10 forms 
were presented, but in a different random 
order. (b) A different newspaper and ques- 
tionnaire were used. (c) The learned and un- 
learned forms were paired according to a 
different random order. 

Prior to Phase I, all Ss participated in a 
brief practice session which acquainted them 
with the experimental task. The practice 
slides were of CVC trigrams. A reward 
of $2.00 was given to the one S within 
each group who scored highest on the two 
questionnaires. 

The 10 slides to be learned were presented 
one ata time, cach for 0.5 sec., with a 3.5-sec. 
interval between slides. The Ss were in- 
structed to look at the forms and remember 
them. The order of slide presentation was 
random with the restriction that 4- and 
6-point forms were presented alternately as 
were 16- and 24-point forms. The same 
random order was used for all groups to hold 


TABLE 2 


MEAN ASSOCIATION VALUES AND VARIANCES 
FOR SIMPLE AND COMPLEX FORMS 
or HicH, MEDIUM, AND Low 
ASSOCIATION VALUE 


Simple^ Complex? 
M s: M St 
High 45.20 | 27.36 | 45.20 | 24.96 
Medium | 39.20 | 10.56 | 38.80 | 7.36 
Low 35.20 | 19.36 | 35.20 | 20.16 


* 4 and 6 points. 
b 16 and 24 points. 
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TABLE 3 
ANALYSIS OF VARIANCE OF 
RECOGNITION SCORES 
= = = = == 
Source df MS F 
een Ss 287 2.94 
Fe ea value (A) 1 | 105.08 40.09** 
Complexity (C) d E 
Delay (D) 2 032 
AXC | -O6 
AXD 2| 0.19 
cue 2| £n 
e s : N 
E (between) 276 2.62 
/ithin Ss 288 1.39 . 
En T va) Phase II (P) 1 11.10 | 8.47** 
P XA 1 2.01 
PxC 1 11.01 | 8,4055 
PXD 2 1.05 
PXAXC 1 0.07 
PXAXD 2 1.65 
PG xD 2 1.65 
PXAXCXD 2 3.83 
Error (within) 276 1.31 
Total 575 2.16 
** 5 «.01 


constant the interaction between A and serial 
position. 

On each of the 10 recognition test trials, .S 
saw two forms, oneata time, each for 0.5 s Es 
with a 3.5-sec, interval between them. Dur. 
ing a 7.5-sec, interval between pairs, S 
designated by writing the number “1” or 29 
whether the first or Second form presented 
was the learned form. On half of the 10 trials 
the first form was a learned form, and on the 
other half, it was an unlearned form of the 
same complexity level but of medium A, 


occurrence were not the same for the Phase | 
and Phase II recognition tests, 

A questionnaire administered at the end 
of Phase IT inquired of the extent to which Ss 
coded the forms as an aid in remembering 
them. The question asked was: “Did you 
name or code the forms, in any way, in order 
to help you remember them? „For example, 
some people have used names like ‘stairway,’ 
‘bird flying,’ or ‘witch s hat to identify 

articular forms." Four alternative answers 
ien provided, and .S circled one. The 
TEETE ANES were: (A) always, (B) frequently, 
(C) seldom, (D) never. The question was 
taken from Deese (1956, p. 14), and was in- 
cluded to determine the relationships between 
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recognition accuracy and coding. Positive 


relationships were expected. 


RESULTS 


Analysis of recognition scores.—lhe 
maximum score for the recognition 
test of each phase was 10. A response 
was judged correct and scored 1 if the 
learned form was correctly identified. 
The total number of correct responses 
each S made was counted and the 
sums used as scoresina2 X2 X 3 X 2 
analysis of variance. The between- 
sroup variables were complexity, A, 
and delay. The within-group variable 
was Phase | (vs.) Phase I1 recognition 
accuracy. Table 3 summarizes the 
analysis. 

Listed in Table 4 are the Phase I 
and Phase II mean recognition scores 
for all 12 experimental groups. Also 
shown for each group are the means 
and variances of the combined Phase 
I and Phase II scores, Bartlett's 
(Edwards, 1960) test for the signifi- 
cance of differences among the vari- 
ances was not significant, Groups 1-6 
viewed complex forms, and Groups 
7-12 viewed simple forms. The 
letters H and L in Table 4 indicate 


TABLE 4 


MEAN CogRECT RECOGNITION Sconis 
AND VARIANCES 


Group MI MI |MIII| S? III 
Complex 
1. H5 | 820 | 829 | 1650 |. 4.69 
2. H 10| 8.66 | 841 17.08 | 3.03 
3. H20| 8.08 | 858 | 1666 | 414 
4.L 5 | 745 | 758 15.04 | 10.56 
3. L 10| 729 7.50 | 14.79 5.21 
6. L 20| 7.54 | 687 14.41 | 4.42 
Simple 
7. H5 | 829 | &s4 16.83 | 7.44 
8. H10| 745 | 875 | 1620 3.47 
9. H 20! 8.04 | &54 | 16.58 5.12 
10.L 5 | 733 | 7:70 15.04 | 5.86 
H.L 10| 712 | 7:66 14.79 | 5.12 
12. L 20| 758 | 795 15.54 | 4.08 


FACTORS IN RECOGNITION MEMORY FOR 


TABLE. § 


CODING AND 
OR SIMPLE 


RELATIONSHIP BETWE 
Association VALU 


Simple Forms | Complex Forms 


| SH | SL | CH | CL 
“Always” and | | | 
"Frequently" 21 | 19 | 48 26 
“Seldom” and 
"Never" 51 | 53 24 46 


Whether a group was tested on forms 
of high or low A. The numbers 5, 10, 
and 20 denote delay times. 

Che results of the analysis of vari- 
ance indicate that: (a) All delay 
effects were insignificant. (b) There 
was no evidence that complex forms 
Were more discriminable than simple 
forms, At the 5-min. delay, complex 
and simple forms of low A were 
identified equally well. (c) At the 
?-min. delay, complex and simple 
orms of high A were identified 
equally well. Since the effects of 
Complexity were insignificant under 
oth the high and low A conditions, 
A did not wash out the effect of com- 
Dlexity only under the high A condi- 
ne as was predicted. (d) Within 
th complexity categories, forms of 
Ugh A were recognized more accu- 
rately than forms of low A, thus 
Confirming Prediction 4. A practice 
effect is indicated by the significant 
Main effect of Phase I (vs.) Phase l 
ecognition accuracy. Investigation 

the significant P XC interaction 
attributed the effect to simple forms 
Only, 

Analysis of coding scores.—The 
"lationship between coding and A is 
depicted in Table 5. In order to 
Siminate cells with low frequencies, 
a€ coding alternatives "always" and 

requently” were combined. For the 
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same reason, Ss were combined over 
delays, resulting in a four-category S 
classification: complex high (CH), 
complex low (CL), simple high (SH), 
and simple low (SL). 

The chi-square value for a test of 
independence between simple form 
coding and A was 0.03 (df = 1), in- 
dicating no relationship between cod- 
ing and the Vanderplas and Garvin 
association values for simple forms. 
A one-sample (Siegel, 1956) chi- 
square test showed that simple forms 
were coded “seldom” and “never” to 
a significantly greater degree than 
they were coded “always” and “fre- 
quently”; x? (1) = 28.44, p < .001. 
There was a positive association be- 
tween complex form coding and A; 
x? (1) = 12.26, p < .001. 

The relationship between coding 
and recognition accuracy was meas- 
ured by correlating Ss’ coding scores 
with their Phase 1 plus Phase ll 
recognition scores. The four coding 
alternatives were scored A = 4, B = 3, 
C 22, and D =1. The obtained 
Pearson product-moment correlation 
coefficients are presented in Table 6, 
along with mean recognition scores, 
mean coding scores, and standard 
scores which denote the significance 
of the correlations. These statistics 
were computed after combining .Ss 
over delays, and are therefore based 


on the performance of 72 Ss. The z 


TABLE 6 


MzaN RECOGNITION AND CODING SCORES, 
AND CORRELATIONS BETWEEN Ss’ 
CopING AND RECOGNITION 


SconEs 
Mea Mea S 
Groups | recosaition | Coding 5 3 
CH 16.75 2.69 0.49 | 4.45** 
GL 14.75 2.06 0.31 | 2.81** 
SH 16.54 1.91 0.22 2.00 
SL 15.12 1.76 0.20 1.81 
i **p <01. 
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scores are transformations ot the 
correlation coefficients, and indicate 
the significance of the correlations 
when referred to the standard normal 
distribution (McNemar, 1962). Val- 
ues over 2.58 were accepted as signifi- 
cant. The relationship between cod- 
ing and recognition accuracy was 
significant for complex forms, but not 
for simple forms. 


DiscussioN 


Recognition for complex forms was 
positively related to A and coding, and 
the latter two variables were positively 
related to each other. Recognition for 
simple forms was positively related to A, 
but A was not significantly related to 
coding, nor was coding significantly 
related to recognition accuracy. Ignor- 
ing the coding data, it appears that Ss 
were able to make associations to most 
forms of high A, and were consequently 
able to recognize these forms more 
accurately than forms of low A. 

The relationship between coding and 
A for complex forms suggests that our 
coding measures and the Vanderplas and 
Garvin A measures are very similar, 
The same relationship obtained for 
simple forms Suggests that the two meas- 
Other evidence 
which suggests that our coding data and 
the Vanderplas and Garvin measures are 
not the same is the finding that form 


If our coding measure is reliably 
different from the A measure, it may 
reflect different processes underlying the 
perception of simple and complex forms, 
If such is the case, the two Processes are 
probably not equally efficient, A t test 
on the difference between the Phase I 
simple and complex recognition means 
was insignificant, but a corresponding 
test on the Phase II means gave a ¢ of 
2.370 (p < .01). The memory traces 
established for the simple forms seen 
during Phase I might have been more 
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stable than those established for the 


complex forms, resulting in the practice 
effect. Alternatively, learning may have 
been equal, but the retroactive effect 


of the Phase I recognition test may have 
been greater for complex than for simple 
forms. . 

Postexperimental conversations with 
‘Ss suggested one way in which the 
perception of 4- and 6-point forms may 
differ from the perception of 16- and 24- 
point forms. The Ss were well aware of 
their making associations to complex 
forms, and found such coding necessary 
for accurate recognition. Simple forms 
were difficult to code, however, and Ss 
claimed they relied on remembering an 
uncoded image of the entire contour of 
the form. Perhaps the contours of simple 
forms can be stored, while the contours of 
complex forms cannot be stored. The 
Practice effect for simple forms, then. 
may be related to this supposition. It 
may be easier to build upon a stored 
image of an entire form rather than on 
one of several possible associations to à 
form. : 

The obtained difference in recognition 
accuracy between simple forms of high 
and low A remains Unexplained. Perhaps 
the simple forms of high A had character- 
istics which allowed images of their 
contours to be remembered more effi- 
ciently than the Contours of simple forms 
of low A. 

Finally, the absence of any delay 
effects suggests that the burden placed 
on the memory process was well within 
Ss’ short-term memory, or that the 
recognition task employed was too gross 
to detect any memory changes. Had a 
recall measure of remembering been em- 
ployed, or had longer or shorter delays 
been sampled, delay effects might have 


been found. 
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AINED CLASSIFICATION ! 
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Johns Hopkins Un iversity 


2 experiments on clas 


and Orientation of the dots. 
presented to S, who dichotomoi 


In 


for attributes as well as by discrin 
discriminability of compe 
S is required to sort stimulus sets 
E, and speed of sorting is measur: 
of the criterion attribute 


In a classification experiment, stim- 
uli are generated by combining the 
different possible levels of twoorm 
attributes to provide 
variate stimuli. The task of S in such 
experiments is to place each stimulus 
from the total set into one of a number 
of different classes, either by a 
differential response or by actually 
placing stimuli in different groups. 
Whatever the particular experimental 
procedure, S must demonstrate both 
differentiation and generalization— 
differentiation because he must use a 
different label or group for each cl 
of stimuli, and generalization bec. 
he must use the same | 
for all members of the same class, 

The purpose of the present experi- 
ments is to investigate some of the 
factors which affect classification per- 
formance, in particular those which 
relate to differences in the attributes 
which define the stimuli: 

Preference for altributes.— While 
there are many ways of forming 


1This research was done in Part under 
Contract Nonr-248(55) and in Part under 
Contract Nonr-4010(03) between the Office 
of Naval Research and the Johns Hopkins 
University. This is Report No. 6 under the 
latter contract. Reproduction in whole or in 
art is permitted for any purpose of the 
Tinte States Government. 


ore 
à set of multi- 


ass 
ause 
abel or group 


fication we: 
of a pair of dots which could vary 


usly classifies 
he chooses. In this task choices are affected 


ting attributes, 


according to the 
ed. 
affects perfo 


re carried out with stimuli consisting 
in Position, Distance between dots, 
free classification, stimulus sets are 
according to any attribute 
l by individual preferences 
of the attribute chosen and 
In constrained cla: ation, 
attribute specified by 
In this task, only discriminability 
rmance, 


ninability c 


classes of stimuli from the total sct, 
the simplest and by far easiest for S to 
use (Shepard, Hovland, & Jenkins, 
1961) is for each level of one attribute 
to define one class, With classes 
formed from a single attribute, the 
task of S is to use that one (relevant) 
attribute for differentiation, and to 
ignore differences jn all other (ir- 
relevant) attributes, ie. he must 
generalize across different values of 
the nondifferentiating attributes. 

It seems reasonable that some at- 
tributes should lend themselves more 
naturally to such differentiation than 
other attributes, perhaps because they 
"stand out" more, are more “salient,” 
or are just preferred by S for any of 
a number of possible reasons. As 
reasonable as it may seem, however, 
the determination of these properties 
of attributes in classification tasks is 
difficult because of their interaction 
with other Properties such as dis- 


criminability, Archer, Bourne, and 
Brown (1955), for example, suggest 
that 


any differences between attri- 
butes in ease of concept learning are 
Probably due to differences in dis- 
criminability, specifically referring to 
such differences as found by Heid- 


breder (1946) and Baum (1954). 
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(See Hunt, 1962, pp. 55 f., for a fuller 
discussion of this problem.) 
Discriminability of an altribute.— 
In order for a particular attribute to 
provide the basis for differentiation 
of classes, its two or more levels must 
be discriminable from each other. 
Furthermore, the different levels on 
one attribute may be more discrimi- 
nable than the levels on another 
attribute. It does not seen reason- 
able to ascribe toan attributea special 
property such as salience, dominance, 
or even just preference, unless we can 
be assured that the property is not 
Simply the result of the discrimina- 
bility of the attribute. 
M This problem is clear in its extreme. 
€ would not say an attribute is not 
preferred, or is ineffective, if its two 
levels cannot be discriminated. In 
Such a case, the attribute is simply 
not functional. But if two attributes 
are compared for effectiveness, and 
isi fhem has levels which are much 
b iserininable than the other, we 
iy still be hesitant to say that the 
org discriminable attribute was the 
Sade or more effective attribute, 
"s cce conclusion could easily 
sa Pow if different levels were 
cannot : ni — i i ER 
ef g rea ly investigate the problem 
Ne special role of attributes in 
Classification without also investigat- 
"Dg the role of discriminability of 
attributes. 
. Types of classification task—Typ- 
‘cally, classification tasks have re- 
{Wired S to learn a classification rule 
Specified by Æ, through a series of 
*Xamples, In a sense, however, 
Classification tasks can be considered 
fe be on a continuum in which the 
eedom allowed Sin his task is varied. 
.t one end of this continuum we 
Would simply give S the entire set of 
Stimuli and ask him to form classes. 
n such a task, which we shall call 
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free classification, the experimental 
result is the determination of the 
actual mode of classification used by S. 

At the other end of this continuum, 
the task is completely specified by Æ 
and is told to SS, so that S has no 
choice, even initially, about how he is 
to form his classes. In such a task, 
which we shall call constrained classi- 
fication, we cannot measure how .S 
forms his classes, but must somehow 
measure the efficiency with which .S 
can use the different rules specified 
by E. 

It seems quite possible that the role 
of a particular attribute in classifica- 
tion tasks will itself depend on the 
nature of the task. Such factors as 
preference for an attribute might 
be operative with free classification, 
where some choice exists for S, but 
might be of no consequence where S 
has no choice about his classifications. 

Thus these experiments are de- 
signed to show the interrelations be- 
tween preference for attributes, dis- 
criminability of attributes, and the 
amount of freedom allowed S in his 
classification task. 


Tur STIMULI 


Each stimulus consisted of a white 
card, 10 cm. square, on which was placed 
a pair of dots, each dot having a diameter 
of 2 mm. Different sets of these stimuli 
were generated, and in each set the 
stimuli could differ from each other with 
respect to three dichotomous attributes 
of the pair of dots. In addition, the two 
levels of each attribute differed in 
relative discriminability, as determined 
by the amount of physical difference be- 
tween the two levels of the attribute. 


Stimulus Characteristics 


A stimulus characteristic consisted 
both of a particular attribute of the 
stimuli and also of its relative discrimina- 
bility. Each of the three attributes was 
used with four different relative discrimi- 
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nabilities, so that there are altogether 
12 different stimulus characteristics. 

Position of dots (P).—One attribute 
involved the lateral position of the pair 
of dots, each pair always being centered 
with respect to vertical location. When 
position was not functional as an attri- 
bute for a particular set of cards, the pair 
of dots was also centered horizontally. 
When it was functional as an attribute, 
then the pair of dots would be located to 
the right of center for half the stimuli and 
to the left of center an equal amount for 
the other half of the stimuli. For a 
relative discriminability of P1, the mid- 
point of the pair of dots was either 2 mm. 
to the right or to the left of center; for 
P2, 4 mm.; for P3, 8 mm.; and for 
relative discriminability of P4, 16 mm. 
to the right or left of center. 

Distance between dots (D).—The second 
attribute involved the distance between 
the pair of dots. When distance was not 
functional as an attribute, the two dots 
were 10 mm. apart at their centers. For 
the four relative discriminabilities, the 
two distances were, respectively: D1, 
9.5 or 10.5 mm.; D2, 9.0 or 11.0 mm.; 
8.0 or 12.0 mm. ; and D4, 6.0 or 14.0 mm. 

Orientation of dots (O).—The third 
attribute was the angular orientation of 
the dots. When orientation was not 
functional as an attribute, the pair of 
dots was always vertical. For the four 
relative discriminabilities, the dots were 
rotated clockwise or counterclockwise in 
the following amounts: O1, 1.5°; O2, 
4.7°; O3, 14.5°; and O4, 45°, 


EXPERIMENT I: Frer 
CLASSIFICATION 
Method. 


Subjects—A total of 24 male under- 
graduates was randomly selected from the 
elementary course in psychology at Johns 
Hopkins University. Each S participated 
singly in all phases of the experiment, which 
required two 1-hr. sessions. : . 

Stimulus sets.—All possible sets involving 
four stimuli (two attributes) and eight stimuli 
(three attributes) were used. There are three 

airs of attributes, PD, PO, and DO, and 
p h attribute of each pair has four relative 
discriminabilities, which were used in all 
possible combinations. So there were 16 sets 
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for each pair of attributes, making 48 sets of 
two-attribute stimuli. 

All relative discriminabilities were also 
used in all possible combinations for the threc- 
attribute stimuli, making 4 X 4 X 4 = 64 
stimulus sets of eight stimuli each. ‘I hus 
there was a total of 112 sets of stimuli used in 
this experiment. . 

Task.—The task of S was to classify or 
sort each set of stimuli into two groups of 
equal size. The S was informed of the general 
nature of the experiment, and the stimuli 
were demonstrated to him. He was also 
informed that there was no correct method of 
classification, so that he was to classify only 
in a way which seemed reasonable to him. 

The recorded measure was the nature of the 
classification, which could ordinarily be de- 
scribed simply by specifying the attribute 
which was used as the basis of differentiation. 

Stimulus presentation.—Since the experi- 
mental measure obtained in this experiment 
was a free choice of method of classification, 
we attempted to minimize artifactual in- 
fluences in the choices, such as could occur 
either due to the temporal sequence of the sets 
or to physical arrangement of a particular set. 

Both four- and eight-stimulus sets were 
intermixed in the sequence, such that at the 
end of each 28 trials, 12 four-stimulus an 
16 eight-stimulus sets had been used. In 
addition, similar sets were kept apart in " 
sequence, and each SS had a different order o! 
presentation, arranged to counterbalance for 
the order with which particular attributes 
were used. " 

When a four-stimulus set was presented, it 
was arranged before S in a square so that à 
vertical separation of the stimuli corresponded 
to one attribute's dichotomy, and a horizonta 
separation corresponded to the other attr 
bute’s dichotomy. The attributes were 
assigned to the horizontal and vertical posi- 
tions in a counterbalanced manner. : f 

The arrangement with eight stimuli was O 
necessity somewhat more complex. T he 
stimuli were placed before S in two rows © 
four each, and two of the attributes could len 
selected by selection of columns, but the e 
attribute had to be selected by selection © 
two stimuli from each row, not in the same 
column. Again, the important consideration 
is that the different attributes were assigned 
to the different positions in an order counter- 
balanced both within and between Ss. 


Results 


Preference for attributes —Since each 
attribute was paired equally often 
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TABLE 1 


Precint Cromer ol 


NVirgmnvrE ny THREE Tyres ok Ss iN liti CLASSIFICATION 


Attribute Chosen 


S Type | oN : 

: | Distance | Orientation | Position | Other 
D preferring 122 | 69.0 | 22.3 | 5.3 | 3.4 
O preferring 8 22.8 61.9 | 12.2 3.1 
DO preferring 4 39.5 40.0 19.0 1.5 
All Ss 24 48.7 38.4 9.9 3.0 


Note.—These percentages represent choices from all two- and three-attribute sets. 


with all other attributes, in both two- 
and three-attribute sets, and with all 
relative discriminabilities, the overall 
percentage of times that each attri- 
bute served as the basis of the free 
classification can give some indication 
of the average preference for each of 
the three attributes. These average 
Percents, for all Ss, are shown in 
Table 1, and indicate that Distance 
was chosen nearly half the time, 
Hentation over a third of the time, 
ie Position less than a tenth of the 
si Ss Classifications not based on a 
ngle attribute occurred only 3.096 
of the time. 
gus periment I I, reported later, shows 
t at the relative discriminabilities of 
7 € three attributes are all nearly 
are eco in terms of speed of dis- 
: mination, a fact which allows us to 
Onclude that these differences in per- 
Cent choices of attribute do reflect 
true differences in preference for the 
attributes, 
Individual differences in preference. 
~ Even though there exists an overall 
ference in preference for the three 
ș tributes, there are quite striking 
orlividual differences in the pattern 
t these preferences. Table 1 presents 
ese individual differences in one 
Possible way. The total choices for 
a S were determined, and if an S 
1 ose one of the three attributes more 
ranni 50% of the time, he was labeled 
S preferring that attribute. Twelve 


of the Ss chose Distance more than 
half the time, and the pattern of all 
their choices is shown in Table 1. 
Eight Ss chose Orientation more than 
half the time, and their choices are 
also shown. Four Ss chose no single 
attribute more than half the time, but 
the pattern of their choices makes it 
clear that they preferred either Dis- 
tance or Orientation to Position, and 
they have been so labeled. 

It is of particular interest that 
Position was chosen so seldom. The 
strong lack of preference for Position 
as an attribute can be shown also by 
considering the order of preference of 
attributes for each individual S. 
With three attributes, there are six 
possible orders of preference. The 
two orders with Position most pre- 
ferred never occurred. The preference 
order of D» O0» P occurred 13 
times; the preference order of O 
> D > P occurred 9 times, and the 
other two orders occurred once each. 
In other words, Position was the least 
preferred attribute by 22 of the 24 Ss, 
and it was second least preferred by 
the other 2. 

The individual differences in prefer- 
ence, then, are concerned almost ex- 
clusively with whether Distance or 
Orientation is the most preferred at- 
tribute, and all but four Ss are easily 
placed in one or the other of these two 
types. Additional data analyses will 
be presented only for the D-preferring 
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PERCENT CHOICE OF ATTRIBUTE 


RELATIVE DISCRIMINABILITY 


Fic. 1. Effect of rel 
(The abscissa indicates the rel 
attribute and shown in the fi 
indicated attribute over 
Ss; two-attribute sets only. 
difference between the levels o 


gure. 


f an attribute 


5s, but the pattern of results is the 
same for the O-preferring Ss, except 
for the consistent change in preference. 

Relative discriminability.—The rela- 

tive discriminability of an attribute 
does have a striking effect on 
probability of choice as the b. 
classification, as shown in Fig. 
this figure only choices from p 
attributes are Shown, and there 
curve for each possible Choice, 
each curve, the relative discrim 
bility of the chosen attribute is shown 
on the abscissa, and the ordinate 
shows the percentage of choices aver- 
aged over all relative discriminabilities 
for the attribute not chosen. 

These curves group themselves into 
two groups of three. The top three 
curves are for the three pairs in which 
the chosen attribute is also the pre- 
ferred attribute, as measured by total 
choices. The bottom three curves are 


its 
asis of 
1. In 
airs of 
is one 
For 
ina- 


ative discriminability on choice of an 
ative discriminability of each attri 


attribute in free classification. 
ibute as paired with each other 


The ordinate represents the proportion of choices of the 
all discriminabilities of the other attril 


Note that relative discriminabil 


bute. Data from phe 
ity is defined by the physica 


, as indicated in the text.) 


for the pairs in which the nonpreferred 
attribute is chosen, and these choices 
occur much less often. Even for these 
nonpreferred choices, the relative dis- 
criminability does affect the proba- 
bility of choice of the attribute. 

Thus the relative discriminability of 
the chosen attribute affects its choice, 
but so also does the relative discrimi- 
nability of the nonchosen attribute. 
Figure 2 shows the nature of this inter- 
telated effect. For this figure the 
data have been combined in another 
way. For each pair of attributes, the 
number of times that the preferred 
attribute was chosen was counted, and 
the resultant percent figures were 
averaged for all three pairs of attri- 
butes, but with the relative discrimina- 
bility of both attributes kept separate, 
as plotted in Fig. 2 
. This figure shows that an increase 
In the relative discriminability of a 
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preferred attribute does increase its 
probability of choice, and this relation 
holds for all relative discriminabilities 
of the nonpreferred attribute. On the 
other hand, an inerease in the relative 
discriminabilitv of the nonpreferred 
attribute also increases its probability 
of choice, but in this case it has an 
effect only when the relative discrimi- 
nability of the preferred attribute is 
low. In other words, if the preferred 
attribute has a high enough relative 
discriminability, it will be chosen re- 
gardless of the relative discrimina- 
bility of the nonpreferred attribute. 
The preference for an attribute seems 
to give it a priority of choice which 
will operate whenever the discrimina- 
bility of the preferred attribute is high 
enough. 

Number of competing altributes.—M 
a set of stimuli differs in only one 
attribute, then classification will occur 
with respect to that attribute. In 
other attributes are added to the 
stimuli, thev can be considered to be 
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competing attributes in that they will, 
given sufficient preference and dis- 
criminability, interfere with the choice 
of the original attribute as the basis of 
classification. Figure 2 can be inter- 
preted as showing the effect of amount 
of interference produced by one com- 
peting attribute with different relative 
discriminabilities. And that figure 
shows that if the preferred attribute 
has a low enough discriminability, and 
the competing attribute a lfigh enough 
discriminability, then the nonpre- 
ferred attribute can interfere to the 
extent of being chosen more often than 
the preferred attribute. 

When choices are made from among 
three attributes rather than two, in 
effect there is an additional competing 
attribute. The obtained percentage 
of choice of each attribute when all 
three attributes are used together is 
shown in Fig. 3, as a function of the 
relative discriminability of the chosen 
attribute. These data show that 
Distance is still clearly the most 
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preferred attribute, with Orientation 
Next most preferred and Position least 
preferred. Furthermore, an increase 
in relative discriminability increases 
probability of choice for all three 
attributes, as it does for pair choices. 
We would like to know whether 
there is any additional interference 
effect when all three attributes are 
combined, over and above the extra 
interference to be expected by the 
simple addition of another competing 
attribute. This question can be 
answered by comparing the percent- 
age of choices shown in Fig. 3 with 
values expected on the assumption 
that the total interference is simply 
the composite of that produced by the 
two competing attributes acting inde- 
pendently. These theoretical values 
are shown as the filled circles in Fig. 3. 
The theoretical values were ob- 
tained by multiplying the two appro- 
priate probabilities of pair-choices, 


nability on choice of an attribute in free cl 
ng in three attributes, (Data from D-p 
as indicated. 

represent values expected 


ssification 

i y: Zac "Cds 
referring Ss only. Each curv E 
The open symbols represent actual data, ane 
on the assumption of independence of 


obtained from Fig. 1, although sepa- 
rately for each S. To illustrate, with 
the composite percents, Distance with 
relative discriminability of 2 is chosen 
over Position 87% of the time, and 
over Orientation 81% of the time. 
The product of these two values is 
70%, which is the value to be ex- 
pected for choices of Distance with 
this relative discriminability over all 
possible combinations of discrimina- 
bilities of Orientation and Position. . 

The rationale for this calculation is 
as follows: Suppose a single S has 
chosen D2 over P1, P2, and P3, but 
Dot over P4; ang suppose he has 
chosen D2 Over O1 and O2, but not 
over O3 and O4, His percentage of 
choices over the four possible values 
of P is 75, and his percentage of 
choices over all possible values of O 
is 50. Now when D2 is used in a 
three-attribute set, it is paired with 
the 16 possible combinations of values 
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of discriminability of Oand P. D will 
be chosen only if P has a relative dis- 
criminability of 1, 2, or 3 and O has a 
relative discriminability of 1 or 2. 
Only 6 of the 16 combined conditions 
Satsly this requirement—and this 
equivalent proportion is obtained by 
Multiplying the two percent figures 
obtained from the pair choices. The 
values plotted in Fig. 3 were obtained 
by carrying this calculation out for 
each S separately, and then averaging. 
If the pooled data of Fig. 1 are used, 
the expected values are very slightly 
smaller, 


Overall, the comparison between 
these theoretical values and the ob- 
tained values is very close, and we can 
Conclude that there is no substantial 
additional interference produced. by 
Combining competing attributes, over 
Arel above the simple combination of 
Their Independent. interfering. effects. 
Gann effect we obtained no addi- 
ees Information from the experi- 
that oe three attributes over 

,. Ovtained from the pair choices. 
"t ee effects. —At the beginning of 
imis ee Ss are relatively un- 
hei; mA ith the attributes, and with 
Possible d discriminabilities. One 
amiiliar: Consequence of this lack of 
tribute my is that preference for at- 
d ln would be less important than 

inability. 

hase. the course of the experiment 
f Was an increase in the relative 
x ies of the preferred attribute. 
istan first 10 trials, these Ss chose 
ositi ice 60%, Orientation 31%, and 
lon 9% of the time. In the last 
imus these figures had inm 
time, a to 79, 19, and 2% : the 
Cca; hus the preferred attribute 
Ne even more preferred, and the 


Pre 


s i 
ch... Preferred attribute was hardly 


Sen at all at the end of the 
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EXPERIMENT I]: CONSTRAINED 
CLASSIFICATION 
Method 


Subjects.— l'welve Ss participated in this 
experiment, all of them selected from the first 
experiment, Six of them were Distance- 
preferring Ss, and six were Orientation-pre- 
ferring Ss, although one of these latter Ss 
did not complete the experiment. 

Task.—ln this experiment the task of S$ 
was to sort a deck of 32 stimulus cards into 
two piles differentiated by an attribute speci- 
fied by E. The deck of cards was handed to 
S and he dealt one card at a time off the top 
and threw it to the right or left. The S was 
told to make no errors, but to work as fast as 
possible. Both time and errors were recorded 
by E, and the time was told to S after each 
trial. 

Stimulus sets—The stimuli in each deck 
differed from each other in either one, two, or 
three attributes. In the latter two cases, 
sorting was required for each possible attri- 
bute, so that each attribute was at times 
relevant for sorting. 

If all attributes are combined in all relative 
discriminabilities, and sorting is required for 
each attribute, there are 12 conditions of 
single attributes, 96 of paired attributes, and 
192 of triple attributes, making a total of 300 
possible experimental conditions. These 
were reduced to 150 conditions by never using 
a relative discriminability of 3 for an irrele- 
vant attribute unless the relevant attribute 
had the same discriminability, and by never 
using conditions where the relative discrimina- 
bilitv of the irrelevant attribute was consid- 
erably less than that of the relevant attri- 


bute. 

Order of presentation —A different random 
series of these 150 conditions was assigned to 
each S. Each series was divided into five 
groups of 30 decks of stimuli, and each group 
of 30 was run in 1 hr, An initial practice hour 
was used, however, in which the decks from 
the fifth experimental hour were used as 
practice decks. 

Control trials.—An additional deck of cards 
was used to provide us with a measure of 
changes in sorting speed either over the course 
of the experiment or within each 1-hr. 
session. This control deck consisted of 32 
cards on each of which was a 10-mm. circle 
in the middle. The circle was red for half the 
cards and blue for the other half, and the two 
colors were highly discriminable. The Ss 
were required to sort these cards by color. 

At the beginning of each session, three 
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TABLE 2 


CORRECTED SORTING TIMES FOR THREE 
ATTRIBUTES AND TWO TYPES OF Ss 


Attribute 


S Type 


| Iz on] 
Distance | Orientation | Position| M 
| 


9.56 | 9.01 


1 ferri 7.60 

D preferring 6.44 Si j E 

[9] Hope. 8.37 11.00 | 10.41 
M 7.41 


Note. ach score is the mean corrected sorting 
time for all sets and Ss under that condition. 


runs on the control deck were used for 
warm-up, and no measurement was made of 
time. Then two more measured runs of the 
control deck were made, followed by 10 ex- 
perimental decks; then two measured runs 
of the control deck were made after each 10 
experimental decks, 

Corrected sorting time-—For each S, his 
control trials were plotted over the course of 
each session and a smoothed curve drawn. 
Then the sorting times required for cach ex- 
perimental condition were corrected by sub- 
tracting the control time from the obtained 
time. 

These precautions to provide control trials 
turned out to be somewhat unnecessary, since 
an analysis of variance of the times obtained 
with the control trials showed no significant 
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and all interference conditions.) 
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Effect of relative discriminability on speed of constrained classific 


R. GARNER 

effects either between sessions or within 
sessions. Apparently the 1-hr. practice 
session and the warm-up trials stabilized per- 
formance. There were, however, large and 
significant individual differences in the control 
times, so that the use of the corrected sorting 
times does give us a reduced inter-S varia- 
bility, and somewhat greater precision. for 
some of the statistical tests, 


Results 


Efect of preference.—Vrom the first 
experiment we had established a 
preference for attribute in free classi- 
fication, and in this experiment used 
both Distance-preferring and Orienta- 
tion-preferring . Table 2 shows the 
mean corrected sorting times for these 
two groups of Ss, for each attribute. 
An analysis of variance was carried 
out on these data, using each S's 
average score for cach attribute as the 
unit of analysis. This analysis shows 
that the difference in the mean scores 
between the two groups of .Ss is not 
significant, due to the large inter-5 
variances. But more importantly, the 
interaction between groups of Ss and 
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@ With competing attributes 
O With no competing attributes 


| 


CORRECTED SORTING TIME 
IN SECONDS 


3 4 


RELATIVE DISCRIMINABILITY 


Fic. 5. Effect of relative discriminability on speed of constrained classification with and 


wi : " 
thout the presence of competing attributes. 


(Each plotted point is the average for all 5s, 


attri ia 1 i 
ibutes, and for both one and two competing attributes.) 


rl: does not approach signifi- 
eters Thus the fact that some Ss 
THEA, s classify by Distance does not 
oy a they sort relatively faster 
to dies ws ibute than Ss who prefer 
then, [teste by Orientation. There is, 
and ye EE e relation between preference 

S performance.? 
dect P average, there is a significant 
Provide attribute, such that Distance 
© th €s somewhat faster sorting than 
results other two attributes. | This 
Unequ be shown to be the effect of 
the Er relative discriminabilities of 

"attributes, 

eid discriminability.—Vigure 4 
each mean corrected sorting time for 
Telati attribute as a function of the 
tve discriminability of the attri- 


is, of course, logically possible that Ss’ 
Nees, as exhibited in the first experi- 
ind disappeared in the time required 
Cems ime the second experiment, but this 
Were ighly unlikely since the preferences 
actual] ble over two 1-hr. sessions, and had 
of Y increased in severity over the course 


rst experiment. 


2 
It 
Prefere 
ment, 


bute. Since S preferences do not affect 
sorting speed, data for both groups of 
Ss and for all conditions have been 
averaged for these curves. 

It is apparent that relative dis- 
criminability has a very considerable 
effect on sorting speed. These curves 
also show why Distance provides 
faster sorting on the average than the 
other two attributes. We had at- 
tempted to make each level of relative 
discriminability to be comparable for 
all three attributes, and clearly suc- 
ceeded except for the lowest relative 
discriminability. Since the differ- 
ences at the three highest relative 
discriminabilities are not significant, 
we cannot conclude that Distance is 
an inherently better attribute for con- 
strained classification purposes. 

Number of competing attributes.— 
Figure 5 shows mean corrected sorting 
time as a function of relative dis- 
criminability for the cases where 
there was only one attribute and for 
all combined cases where there were 
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competing attributes. Analysis of 
variance shows no significant differ- 
ence between these two conditions, so 
that we can conclude that the presence 
of interfering attributes has certainly 
little effect at best on constrained 
classification—a result again unlike 
that found with free classification. 

Error analysis.—The total number 
of errors made was too small to allow 
detailed analysis, but what effects 
appear at all simply confirm the re- 
sults obtained with time measures. 
For example, more errors are made 
with the lowest relative discrimina- 
bility than with the higher discrimina- 
bilities. There is, then, no evidence 
that conditions giving faster sorting 
speeds did so by the production of 
more errors. Thus there was no com- 
pensating effect, 


Discussion 


The major Purpose of these experi- 
ments was to compare factors affecting 
performance of two different types of 
classification task, with the particular 
intent of determining whether it is pos- 
sible to determine a special property of 
some attributes compared to others 
Which will make them preferred. A com- 
parison of tasks for each of several factors 
will facilitate an understanding of the 
difference between tasks, 


Relative discriminabilit 
periments show that as the difference 
between the two levels of an attribute 
increases, that attribute is more apt to be 
chosen as a basis of classification in free 
classification, and that sorting on the 
basis of that attribute is faster. This 
result for the constrained classification 
task is not new, since Reed (1951) had 
shown this result for sorting by attributes 
of visual area and hue. 

The fact that free classification is also 
severely affected by the relative dis- 
criminability of the attributes makes 
very clear that it is not possible to talk 
ibout the properties of a particular at- 
tribute without consideration of relative 


y—These ex- 
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discriminability. Thus it is not sensible 
to say, for example, that concepts (classi- 
fications) can be formed more casily with 
color than with size unless we know that 
these two attributes are, in the 
situation, equally discriminable. 

Competing attributes.—The lack of any 
effect of number or discriminability of 
competing attributes on sorting speed is 
a little surprising, but quite in keeping 
with previous research. Various Es on 
sorting tasks have found either no differ- 
ence due to the presence of competing 
attributes (Archer, 1954; Morin, Forrin, 
& Archer, 1961), or else very slight 
differences (Crossman, 1953; Gregg, 
1954; Reed, 1951). The lack of effect in 
our experiment was surprising only in 
that we used more extreme conditions of 
interference. For example, some of the 
sorting conditions required sorting by an 
attribute with relative discriminability 
of 1, against two attributes each with a 
relative discriminability of 4. Even 
under this circumstance, sorting was not 
impaired by the presence of the com- 
peting attributes. 

In the free classification task, the 
problem of competing attributes isa little 
different. If a set of stimuli differs on 
justa single attribute, then classification 
can occur only with respect to that 
attribute. But interference must occur 
if two such attributes are combined into 
a single stimulus set, since classification 
cannot occur by both of them simultane- 
ously. However, the interfering effects 
found with free classification concern 
much more than this inevitable result, 
Since the nature of the choices is modified 
by the relative discriminability of the 
Competing attribute, Thus in free 
classification, the discriminabilities of all 
attributes enter into the result, whereas 
in constrained classification only the dis- 
ctiminability of the relevant attribute 
affects performance, 

Preference for attributes.—The logical 
Problem of this Paper has been to deter- 
mine whether we can say that one 
attribute is preferred over another (or is 
more salient, easier to use, etc.). We 
need to know that the attributes being 
compared are the same with respect to 


given 
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other properties which could affect the 
choice, and the property we are most 
concerned with is that of discriminability 
of an attribute. 

Since in the free classification experi- 
ment there were substantial individual 
differences in choice of attribute, we have 
some confidence that the preference is 
not an artifact. It seems unlikely that 
differences in discriminability would 
display the same individual differences as 
the differences in preference, so that the 
differences in preference are probably 
real and exist over and above any 
differences in discriminability. 

A major purpose of the constrained 
Classification experiment was to provide 
à means of determining whether the 
three attributes were equally discrimi- 
nable at cach of the four levels of relative 
discriminability we chose. Our results 
showed that speed of sorting was the 
Same for all three attributes except at the 
Owest relative discriminability. In order 
to determine whether this difference at 
this lowest level of discriminability could 
have affected the preference measures, we 
tabulated choices for all cases where none 
die diebus had its lowest level of 
iat ^ iia RON lhis tabulation showed 
Ss Pain| erence pattern for individual 
in ference the same, but the differences 
Segre ence within each group of Ss was 
be ik : This result is consonant with 
indicat ation shown in Fig. 2, which 
iS dika that if the preferred attribute 
teataja able enough, it will be chosen 
Som ess of the discriminability of the 

Peting attributes. 

"Us if we can accept the speed of 


Sort an 
cri ting as a legitimate measure of dis- 
„Minability, then we can say that there 


exist 
otl 
no 


Preferences for some attributes over 
aers, and that these preferences are 
t due to differences in discriminability. 
shows Prison of characteristics.—Table 3 
the 4:8 Very useful way of summarizing 

he tel erences in these two tasks, and 
b ative role of preference for attri- 
© and discriminability in these two 


ta, 
istics Each of the 12 stimulus character- 
our | (defined by three attributes and 


*vels of relative discriminability) 
ranked in this table according to 
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TABLE 3 
RANKINGS OF STIMULUS CHARACTERISTICS 
AND CONSTRAINED 
SIFICATION FOR Two. 
Types or Ss 


Free 
Classification 
Rank = — 
D o D [o] 
Preferring | Preferring | Preferring | Preferring 
1 D4 O4 D4 D4 
2 DS O3 P4 P4 
3 O01 02 O4 O4 
+ D2 D4 D3” D3 
5 O3 P+ P3* 03 
6 02 D3 O3 P3 
i D1 P3 D2 02 
8 P4 Ot P2 D2 
9 P3 D2 02 P2 
10 Ot pe D1 Di 
11 P2 D1 Ol P 
12 PL P1 P1 O1 


Note.— The letter indicates the attribute and the 
number following it indicates the relative discrimina- 
bility. A lower rank number indicates higher frequency 
of choice or faster sorting. 

a Tied ranks. 


their effect on performance in each task 
and for each group of Ss. For the con- 
strained classification, the ranks are 
obtained simply by determination of 
mean sorting time for all conditions in 
which that stimulus characteristic pro- 
vided the classification criterion. The 
correlation between rankings (rho) for 
the two groups of Ss in constrained 
classification is .96, indicating that the 
type of S does not affect sorting perform- 
ance. But simple inspection makes the 
predominant role of relative discrimina- 
bility even clearer, since in no case does 
an attribute provide faster sorting than 
another when it has a lower relative 
discriminability. Ranking with respect 
to relative discriminability is perfect. 
The rankings for free classification 
were a little more complicated to obtain, 
since there exist no direct comparisons 
between levels of relative discrimina- 
bility for the same attribute. We con- 
structed a pair-comparison matrix for all 
paired attributes and conditions, and 
completed the matrix by assuming that 
if two levels of relative discriminability 
on the same attribute had been paired, 
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the more discriminable attribute would 
always have been chosen. 

The correlation between ranks for the 
two groups of Ss in free classification is 
.63. If, however, we exchange O and D 
characteristics for the O-preferring Ss, 
to reflect the difference in preference, 
then the correlation becomes .90. In- 
spection again shows the great impor- 
tance of the particular attribute in free 
classification. To illustrate, Distance- 
preferring Ss choose the least discrimi- 
nable level of Distance over the most 
discriminable level of Position. 

To summarize, in constrained classi- 
fication only the discriminability of the 
criterion attribute is important in speed of 
sorting. In free classification, discrimi- 
nability of the chosen attribute, discrimi- 
nability of interfering attributes, and a 
preference for attribute over and above 
these factors all affect performance. 
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REACTION TIME TO CHANGES IN THE INTENSITY 
OF WHITE NOISE! 


DAVID H. RAAB asp MITCHELL GROSSBERG 


Brooklyn College of the City University of New York 


Reaction time to a noise burst (AD) added to ongoing noise (I) was 


found to decrease with increasing AI and with decreasing I. 


For 


constant values of AI/I, RT generally decreased with increasing I—a 
finding of significance for decision models of RT. 


The current literature on psycho- 
acoustics includes a number of papers 
in which signal-to-noise ratio, de- 
tectability, loudness, and reaction 
time are related to each other. The 
d' index of detectability—as developed 
by Tanner, Swets, and Green—is 
equal to ¥2E/No for a pure tone signal 
known exactly to an “ideal observer” 
(see Green, 1960b). (E = signal 
energy; Ny = noise power density.) 
When the signal is itself a burst of 
Noise, d’ is a monotonically increasing 
function of the ratio of signal power to 
noise power (Green, 1960a). Creel- 
Man (1963) has shown that tone 
ursts of equal loudness have identical 
d indexes. McGill (1963) has out- 
lined. à detection model of simple 
reaction time (RT) and has inferred 
Tom the model that RT and loudness 
are inversely related and that RT 
distributions are identical for signals 
that are equally detectable. 

McGill cites Greenbaum’s (1963) 

ata in support of the second infer- 
ence, Greenbaum measured RTs to 
the Onset of a 1,000-cps tone masked 

Y continuous noise. Signal-to-noise 
ratio (signal level: noise spectrum 
vel) was fixed at approximately 30 

Similar RT distributions were 


(Gp his research was supported by a grant 
tio 71118) from the National Science Founda- 
Coll and by funds provided by Brooklyn 
ege. The data were collected by Robert 
aron and David Charney as part of an 
nors course. 


obtained for four different levels of 
masking noise. 

Actually, as McGill noted, there 
were differences among the four RT 
distributions. Median RTs, for ex- 
ample, decreased systematically with 
increasing mask level; a 10-msec. 
decrease in RT resulted from a 30-db. 
increase in masking noise level (see 
McGill, 1963, Fig. 6). 

Equal signal-to-noise ratios (S/N) 
did not generate equal RTs in Cho- 
cholle’s (1943, 1944) studies with 
tonal signals and tonal masks. In all 
cases, RT was shortened as back- 
eround stimulation was increased in 
intensity. 

Visual RTs have been measured to 
increases (AI) in the luminance of a 
portion of a lighted background (J). 
Decreased RTs at higher levels of 7 
(for constant AI/I) can be seen in 
studies by Piéron (1936), Durup and 
Piéron (1937), Steinman (1944), and 
Bartlett and MacLeod (1954). 

In the present study, simple RTs 
were measured to noise bursts added 
to background noise. Background 
intensities—all within the region de- 
scribed by Weber's law—covered a 
range of 60 db.; S/N was varied 
between 0.63 and 107, i.e., between 
—2 db. and 70 db. Miller (1947) has 
analyzed the masking of noise by 
noise and has noted that this instance 
of masking is operationally identical 
to intensity discrimination; masking 
noise (N) and background intensity 
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(I) are the same, and signal intensity 
(S) equals increment intensity. (AI). 
The effect of wave-form correlation on 
these relations has been discussed by 
Pfafflin and Mathews (1962) and by 
Raab, Osman, and Rich (1963). 


METHOD 


Apparatus.—Masking and increment noises 
were generated and band-limited (100-6,000 
cps) in separate channels. They were 
switched independently and attenuated in- 
dependently before being mixed to produce 
the stimuli for each trial. The acoustic output 
of S's earphone (PDR-10 mounted in a 
MX/41-AR cushion) was measured by a WE 
640-AA microphone in an ASA Type 1 
coupler. Sound-pressure levels (SPLs) given 
in this report are all referred to 0.0002 
dynes/cm?, 


A Berkeley 7350 EPUT counter was used 
to measure RTs, 

Procedure.—Four levels of masking noise 
(25, 45, 65, and 85 db, SPL) were used. 
Increment signals were varied from 23 to 95 
db. SPL. For the 25.db. mask, therefore, 
relative increments (AI/I) ranged from —2 
db. to 70 db.; for the 85.db. mask, AI/I 
ranged from —2 to 10 db. These stimuli 
were delivered through one earphone. 

Reaction-time trials were presented every 
12sec. Each trial began with the presentation 
of the masking sound. The increment noise 
was added to the mask (and the RT counter 
started) 2.0, 2.25, or 2.5 sec. after the onset 
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of the mask. The counter was stopped when 
S depressed a normally closed telegraph key. 
Mask and increment noises were terminated 
together 3.2 sec. after the onset of the former 
—half a second or more after S had responded. 

Experimental sessions consisted, in general, 
of four blocks of 28 trials each. Each of the 
28 different mask-increment combinations 
was presented once in a test block. Since E 
and S changed roles after each block of trials, 
the duration of cach block and the interval 
between successive blocks (for cach S) was 
approximately 6 min, Four practice trials 
were presented before the first block of each 
test session; the three remaining blocks of 
cach session were cach preceded by onc 
practice trial, 

Subjects.—' l'hree Practiced Ss were used. 
All three also served as Es. 


RESULTS AND Discussion 


Twenty test sessions with each S 
provided mean RTs based on 80 re- 
Sponses. Standard deviations of the 
distributions were also computed. 
They are of the order of 20 msec. and 
include intertrial variations within a 
test session as wel] as day to day 
variability. Since the principal trends 
are the same for the three Ss, group 
means are plotted in Fig. 1 and 3. 

Figure 1 shows RT to decrease with 
increasing increment intensity. The 
general ordering of the curves shows 


RT TO CHANGES IN NOISE INTENSITY 


that RT is also a function of masking 
sound pressure. The dependence of 
RT on both background (J) and 
increment (AZ) agrees with the results 
of the visual and auditory experi- 
ments cited earlier. 

Of special interest are the relations 
between RT and AI,I. These are 
given by the plots in Fig. 2. For 
relative increments of 10 db. or more, 
the curves are ordered such that RT 
to a given relative increment is shorter 
at higher intensities of masking noise. 
These data confirm—in the case of 
noise added to noise—what has al- 
ready been reported (see above) for 
light added to light, clicks added to 
noise, and tones added to tone or 
noise. Equal S/N ratios do not 
everywhere generate equal RTs. 

That RT to a masked stimulus 
Shows a kind of "recruitment" ap- 
Pears, at first glance, to contradict the 

etection Theory of Tanner, Swets, 
and Green. Actually, the finding that 
akong latency is not determined 
t *- y by S/N is not inconsistent with 
of phen that detectability isa function 
Es / N. The O in a detection experi- 

ent is asked to decide about the 
presence or absence of weak signals in 
oise. For clearly detectable stimuli 
(S >), on the other hand, loudness 
Judgments, for example, are not 
SPecified solely by S/N (Gleiss & 
degree 1964; Lochner & Burger, 
n Raab & Osman, 1962). Creel- 
lan’s (1963) finding that short tones 
approximately equal energy are 
Oth equally detectable and equally 
Sud applies only to weak signals 
@’ & 4. 
" À similar situation obtains in the 
ase of auditory and visual RT. The 
passent results—together with those 
ae Several prior studies—show that 
M us fixed ratios of AI/I is shorter 
igher intensities of J. If decision 
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models are to be used to describe 
simple RT to partially masked stim- 
uli, they will have to be different 
from the detection models currently 
employed. i 
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HABITUATION OF ALTERNATION 


BEHAVIOR! 


P. E. FREEDMAN 2 


University of Iowa 


Alternation behavior of rats in a 2-alternative maze was investigated in 


2 experiments. 


Neither number of immediate prior exposures nor the 
presence of choice were found to be significant variables. 


However, 


brightness was shown to be an important factor in establishing a 


"chance" performance criterion. 


The demonstration that alternation 


habituates over trials was considered in respect to its effect on learning 


theory. 


Alternation behavior, the tendency 
for an organism, after responding to 
one alternative of a multiple alter- 
native situation, to respond to another 
alternative when given a second op- 
portunity, is a well-established phe- 
nomenon in animal maze studies. 
However, the implications of the 
alternation phenomenon for learning 
theory generally have been over- 
looked. Even when the initial ex- 
bosure is reinforced the tendency to 
alternate on subsequent trials is 
Significant early in training (Estes & 
»choeffler, 1955; Montgomery, 1952; 
Rothkopf & Zeaman, 1952). A major 
9 n question is: as learning 
Sven but does alternation habituate 

trials or does it remain as an 
9Pposing tendency? 
bn des previous studies bear directly 
Is question. Walker (1956) gave 
gn pretest exposure (PTE) trial 

Owed by one test trial each day 
and obtained a level curve of response 
alternation over a series of 7 days. 
Sen level of alternation was signifi- 

tly greater than 50%. Fowler, 
is. his experiment represents a portion of a 
I men submitted to the Teychology 
i Partial ta of the State University ol í oye 
the p ulfillment of the requirements fo 
acknowled degree, The ani kor ees "d 
he Ur cede ce Em abmeupinét 
so en Hal at 

?N of the investigation. 
Labora, at United States Army 

Ories, Natick, Massachusetts. 


Natick 


Blond, and Dember (1959), using two 
and eight forced PTE trials per day 
prior to a test trial, found initially 
high percentages of alternation with a 
decline over days, with a somewhat 
more rapid decline for the eight PTE 
trial Ss. At the end of 10 days both 
curves appear to level off slightly 
below the 50% alternation level. For 
the results reported, all trials were 
reinforced. 

Walker's study would indicate that 
alternation behavior does not habitu- 
ate. Fowler et al. are equivocal on 
this point since the decline in alterna- 
tion could be attributed to a greater 
habit strength being progressively 
built up to the PTE brightness. 

If the alternation tendency does not 
habituate, the following assumptions 
and predictions would be made in a 
two-alternative situation: (a) If S is 
forced to one of the stimuli, e.g., 
response to white (Rw), on a PTE 
trial and is reinforced, there is an 
increment in the habit strength of 
Rw, and a satiation or exposure factor 
to the white stimulus (Sw) is devel- 
oped. Since the satiation factor is 
initially stronger than the learning 
factor, there is a greater tendency for 
S to respond to the nonexperienced 
black stimulus (Sb) on the test trial. 
If S is reinforced for this latter re- 
sponse, an increment in its habit 
strength results, and both responses 
would then have equal habit strengths, 
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Repeating this procedure over a 
period of days, one PTE and one test 
trial per day, the curve representing 
percent alternation over days should 
remain high and stable. (b) If two 
or more forced PTE trials to Sw 
precede the test trial each day, the 
difference over days between the 
habit strengths of Rw and Rb would 
increase due to different number of 
increments in the learning factors. 
Consequently, the habit strength 
of Rw progressively would become 
stronger than that of Rb over a series 
of days, and the percent alternation 
curve should decline. However, since 
Rb would still be made occasionally, 
over extended training the habit 
strengths of Rw and Rb should ap- 
proach the same asymptote and hence 
approach equality. Under this condi- 
tion the satiation factor would again 
tend to make S respond to Sb, and the 
curve of percent alternation should 
rise toward its original level. 

The present study tested these 
predictions in a simple two-alternative 
maze. An additional variable in- 
vestigated was the technique of forc- 
ing. Studies on alternation varying 
number of forced trials have em- 
ployed physical barriers to control the 
stimulus to which .S is exposed on a 
forced trial. The present investiga- 
tion compared the effects of this 
forcing technique with a method of 
controlling stimulus exposure which 
did not employ a physical barrier. 


EXPERIMENT I 
Method 


Apparatus.—A wooden maze was con- 
structed with a 9 X 24 in. start box opening 
into a choice point allowing access to two 
parallel 42 X 33 in. alleys which shared a 
common wall. The last 12 in. of the alleys 
were used as goal boxes. The Plexiglas ceiling 
of the start box was 4 in. high; the ceiling of 
the remainder of the maze was 93 in, high. 
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The start box and choice point were 
painted neutral gray. The alleys were either 
black or white, and sets of alleys could be 
interchanged to present black-white, two 
black, or two white alleys at the choice point. 
A metal insert, painted the same brightness 
as the blocked alley, could be positioned at 
the entrance of an alley for a forced trial. 

Subjects.—One hundred and forty naive 
hooded rats from the colony maintained by 
the Department of Psychology of the State 
University of Iowa served as Ss. At the 
start of the experiment all Ss were between 
100 and 130 days old. 

Maintenance schedule.— l'he Ss were placed 
on a 22-hr. water-deprivation schedule 10 days 
prior to the start of the experiment and re- 
mained on this schedule throughout the ex- 
periment. The Ss were given access to water 
for a 15-min. period beginning 15 min. after 
each daily experimental session. Food was 
available in the living cages at all times. 

Training procedure.—A Ss received one 
or two PTE trials each day under either a 
Free Choice (C) or Forced Yoked (Y) condi- 
tion. Since there was greater initial interest 
in the C condition, 80 Ss were assigned to the 
C condition, and only 60 Ss were assigned to 
the Y condition. Under the C condition 5 
was allowed to choose the right or left alley 
on a PTE trial, but both were the same 
brightness, i.e, white-white or black-black- 
Each S in the Y condition was yoked to one 
of the first 60 of the Ss in the C condition and 
was forced by a physical block to make the 
same PTE response. Within each PTE 
condition half the Ss had one PTE trial, and 
the other half had two PTE trials per block. 
The PTE brightness was black for one half 
of the Ss. After one or two PTE trials each 
day, all Ss were given a test trial where 
could choose between a black-white alter- 
native. For each S black was on the left for 
half the test trials. All trials were reinforce 
with .5 cc of water. The Ss were retained in 
the goal box for 15 sec. The daily series o 
trials for each S was run consecutively. The 


experiment was continued for 40 blocks of 
trials. 


Results 


The response of each S on each test 
trial was scored as 1 if S alternated 
and 0 if S did not alternate: i.e., chose 
the same brightness to which he was 
exposed on the PTE trial(s). The 
data were analyzed with respect to the 


HABITUATION OF ALTERNATION BEHAVIOR 


first 10 test trials individually and the 
last 30 test trials in 5-trial blocks. 

Analysis of variance with Method 
(M), Number (N), and Brightness 
(B) of PTE trials as between factors 
and trials (T) as the within factor 
showed B, T, and B X T interaction 
significant (p < .001) over the first 
10 trials. Over the last 30 trials B 
effects were significant (p < .001) and 
the MXNXB and the TXB 
interactions reached the .05 level. 

Analysis of the first 10 trials within 
PTE brightness showed that only the 
trials effect was significant, both 
within white and within black PTE 
brightnesses (p < .005 and p < .025, 
respectively). There were no signifi- 
cant effects over the last 30 trials in 
cither the white or black PTE bright- 
nesses, The curves of percent alter- 
nation for white PTE and black PTE 
Se presented in Fig. 1 and 2. 

The apparatus minimized differ- 
ential right-left kinesthetic cues. No 
Position preferences were evident in 

s’ responses. 


EXPERIMENT II 


The results of Exp. I which indicate 
that there was no tendency for per- 
Cent alternation to increase with ex- 
tended training lead to the tentative 
conclusion that the alternation tend- 
vut did habituate. However, the 
P lfs do not preclude the possibility 
n the procedures used in Exp. 1 did 
Ot allow the habit strengths of the 
ie vo alternative responses to be equal- 
9r approach equality. Further- 
» even though the overall curve 
Percent alternation approached 
leveled off at about the 5097 level, 
r alternation curves within PTE 
'Bhtness leveled off at about 80% 
reg, 29% for white and black PTE, 
Pectively, Therefore, a second ex- 

‘Ment was designed in which 
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Percent Alternation to White 
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Test Trials ? 
Fic. 1. Percent alternation for the Cr 02. 
Y1, and Y2 Black PTE groups as a function 
of trials. (The first 10 test trials are 
plotted individually. The last 30 trials are 
plotted in blocks of 5 test trials.) 


brightness preference and alternation 
behavior could be recorded while the 
learning factor to each brightness was 
kept approximately equal. 


Method 


A pparatus.—Same as in Exp. 1. 

Subjects —Twenty hooded rats of the same 
age and origin as those in Exp. I served as Ss. 

Maintenance schedule.—Same as in Exp. I. 

Training procedure.—The handling and 
preexperimental treatment of Ss was the same 
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as in Exp. I. All Ss received the same 
experimental treatment. Each S re : d 
four trials per day. On the first two triz s S 
was presented with a "porte Sues 

he left-right positioning of the brightness 
Fs these A trials was BW, BW; W B, WB; 
BW, WB; or WB, BW. Each S received a 
different combination over days. On the last 
two trials S was forced to a particular bright- 
ness by a physical barrier. The barrier was 
painted the opposite brightness from the open 
alley. The brightness to which S was forced 
was determined by his responses on the first 
two trials. In each block of four trials S 
experienced each brightness twice. The order 
and right-left positioning of brightness on the 
forced trials was determined randomly, but 
no S had all left (or right) turns or experienced 
each brightness consistently on a particular 
side in a four-trial block. This training con- 
tinued for 5 days. All trials were reinforced. 


Results 


Considering the first trial of each 
day, of the 100 total opportunities 
black was chosen 80 times and white 
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Fic. 3. Percent choice of black by the 
White PTE Ss of Exp. I (upper part of 
figure) and percent choice of white by the 
Black PTE Ss (lower part of figure) compared 
with initial choice of Exp. II .Ss over trials, 
(The abscissa is plotted in total trials—both 
test and PTE trials. The dotted lines are 
hypothetical extensions of the Exp. II Ss 


choices.) 
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was chosen 20 times. Following an 
initial choice of black, Ss alternated 
(chose white) 16 out of 80 times or 
20% of the time; whereas following an 
initial choice of white, Ss alternated 
(chose black) 17 out of 20 times or 
85% alternation. 

Fifty-five percent of the Ss alter- 
nated on the first day. Alternation 
dropped to 35% on the second day 
and to 25% on the third, fourth, and 
fifth days. 

Discussion 

The findings of previous studies that 
Ss initially alternate at a high level was 
well substantiated by the present study. 
A binomial probability test for all groups 
in Exp. I, with the exception of the 
forced groups with black PTE, showed 
that first day alternation was signifi- 
cantly above 50% (p < .01). Alterna- 
tion, however, declined over days, 
leveling off for the combined brightness 
groups at about the 50% level. Simi- 
larly, in Exp. II, alternation started at 
55% and rapidly dropped to 25%. 

A major problem in determining 
whether or not alternation habituates iS 
the designation of a criterion which 
represents the level of choice § would 
make in the situation without the effects 
of recent prior exposure. This criterion 
will be referred to as the Nonsatiation 
Performance (NSP) level. Most bright- 
ness alternation studies, including the 
present one, have used 50% choice of 
black and white as the NSP criterion. 
However, the results of Exp. II suggest 
that Ss’ empirically determined prefer- 
ences for black and white should be 
Considered in specifying the NSP base 
level. Only one known study (Dember 
& Fowler, 1959) reported a black prefer- 
ence (65%). In Exp. IT of the present 
study the choice behavior on the initial 
trial of each day suggests that Ss 
Preference for the brightnesses used in 
this study were 80% for black and 20% 
for white. Consequently, the results of 
Exp. I were reevaluated using the 80% 


black-20% white preference levels as 
NSP criteria. 
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Phe upper and lower parts of Vig. 3 
present these comparisons. The curves 
for percent initial trial black-white 
choice of Exp. II Ss represent the NSP 
levels. On the early test trials alterna- 
tion (choice of black) for White PTE Ss 
in Exp. I is significantly. above 80% 
($ < .01) and alternation (choice of 
white) for Black PTE Ss is significantly 
above 20% (p < .001) by a binomial 
Probability test. Both curves, then, 
approach and appear to approximate 
their respective NSP levels over the 
remaining trials. The White PTE curve 
Was not different significantly from Trial 
3on (p> .05) and the Black PTE curve 
from Trial 12 on. Therefore, when 
brightness preference is considered in 
estimating the Nonsatiation Performance 
level, there is good evidence that alter- 
Nation is initially present and then 
habituates over trials. 

In Exp. I neither the number of PTE 
trials nor the method of PTE was shown 
to be an effective variable. The latter 
actor was varied in order to investigate 
the finding that there is more alternation 
lowing forced trials than following free 
trials (Dember & Fowler, 1959; Thomp- 
Son, 1962). The failure of this variable 
to produce a difference in percent 
alternation indicates that the element of 
Choice itself is not a factor affecting 
alternation behavior. 

In conclusion, regardless of which level, 
095 or the 80-2095 NSP level, is used 
as a Criterion, alternation on the early 
trials in the present study was found to 
e Significantly above "chance" perform- 
ance, Here it is possible to appeal to any 
of Severa] hypotheses which have been 
Hered to explain alternation: i.e., that S 
Initially has an exploratory drive, curi- 
ity motivation (Montgomery, 1951), 
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or stimulus. satiation (Glanzer, 1953) 
which results in alternation behavior. 
The demonstration in both Exp. Land I] 
that alternation does habituate reduces 
the importance of this behavior in the 
schema of learning theory. However, 
the importance of alternation as a be- 
havioral phenomenon in its own right and 
its possible antagonistic effects on the 
carly trials of learning deserve further 
study. 
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CHILDREN'S PERFORMANCE AS A FUNCTION OF THE 
DEGREE OF VISUAL STIMULUS DEPRIVATION 


AND SATIATION ! 


RICHARD D. ODOM 


University of Minnesota 


112 boys and girls (8 through 9 vr.) participated in a 4 age, lever- 
pressing task with visual stimuli as reinforcers. The preexperimental 
period consisted of a response period with no reinforcers followed bya 
response period with reinforcers. During the experimental period Ss 
made no lever-pressing responses, and 4 conditions involving deprivation 
(reinforcers withheld) or satiation (reinforcers presented) for either 2 or 
10 min. were introduced: D10, D2, S2, S10. The postexperimental 


period consisted of a response period with reinforcers. 
rate of lever pressing increased as degree of satiation decreased 
degree of deprivation increased. ‘The results were interpreted 
support to the notion that visual stimulus depriv; 


act as a source of motivation. 


The concepts of deprivation and 
satiation have frequently been used to 
account for behavior that cannot be 
explained wholly in terms of tradi- 
tional learning variables. Such opera- 
tions involving primary reinforcers 
have been employed in research using 
rats as Ss in investigations of the 
effects of different levels of deprivation 
upon the acquisition of an instru- 
mental response and upon the as- 
ymptotic level of performance (e.g., 
Kimble, 1951; Ramond, 1954). A 
number of studies (Erickson, 1962; 
Gewirtz & Baer, 1958a, 1958b; 
Hartup, 1958) have investigated the 
effects of deprivation and satiation of 
social stimuli on children's subsequent 
performance in tasks where verbal 
approval was the reinforcer. Using 
infrahuman organisms as Ss, Butler 
(1957a, 1957b), Kish and Baron 
(1962), and Fox (1962) found that 


1 This study was supported in part by a 
grant from the National Institute of Mental 
Health (M-03519) United States Public 
Health Service. The author is grateful to the 
staffs of Roosevelt and Ericsson Elementary 
Schools, St. Paul, Minnesota, for their co- 
operation in securing Ss for this study. 


It was found that 
and 
as lending 
ation and satiation can 


effectiveness of certain classes of 
simple visual and auditory reinforcers 
can be modified with deprivation and 
satiation operations. In general, the 
results of the above studies have dem- 
onstrated that reinforcers, whether 
primary, social, or Sensory in nature, 
were more effective following a longer 
period of deprivation than they were 
following a shorter period of depriva- 
tion or a period of satiation. 

More recently, the results of several 
studies (Horowitz, 1962; Stevenson & 
Knights, 1961; Stevenson & Odom, 
1961, in press) have indicated that 
different classes of visual stimuli serve 
effectively as reinforcers for children’s 
behavior. Using children as Ss, Odom 
(1964) found that deprivation and 
satiation operations with visual and 
auditory stimuli influenced the effect- 
iveness of these stimuli as reinforcers 
in a subsequent task. 

If deprivation and satiation are 
considered as points on a dimension of 
reinforcer availability, then the degree 
of deprivation and satiation can be 
varied by increasing or decreasing the 
amount of time in which a potential 
reinforcer is either unavailable or 
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available and responded to by the 
organism. The present study was 
designed to test the effects of degree 
of visual stimulus deprivation and 
satiation on children's performance in 
a simple lever-pressing task. For each 
S there were three stages in the 
experiment: (a) a preexperimental 
period consisting of a response period 
with no reinforcers followed by a 
response period with reinforcers, (5) 
an experimental period (either 2 or 10 
min.), during which reinforcers were 
either withheld (deprivation) or pre- 
sented (satiation), and (c) a post- 
experimental period with reinforcers. 
Sex of S was included as a variable 
since earlier studies (Stevenson & 
Odom, 1961, in press) have indicated 
that visual reinforcers are more effect- 
ive for boys than for girls. 

On the basis of the research pre- 
sented above, certain predictions were 
made concerning the results of the 
present study. Considering both dep- 
rivation and satiation as part of a 
single dimension, it was predicted 
that the rate of responding would 
decrease as a function of decreasing 
degrees of deprivation and increasing 
degrees of satiation. In addition, it 
was predicted that visual stimuli 
would be more effective as reinforcers 
for boys than for girls. 


METHOD 


Subjects —The Ss were 56 boys and 56 
girls from the third and fourth grades of 
Roosevelt and Ericsson Elementary Schools in 
St. Paul, Minnesota. An equal number of 
boys and girls were randomly selected from 
the two grade levels. 

Apparatus: —The study took place in a 
74 X 8 ft. room of a mobile laboratory be- 
longing to the Institute of Child Development. 
The apparatus has been described fully in 
carlier studies (Odom, 1964; Stevenson & 
Odom, 1961). Generally, it consisted of a 
wooden frame upon which a vertical and a 
horizontal panel were mounted. The vertical 
Panel contained a ground glass screen on 
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ee Da projected, „One end 
ES , open-ended tube, 113 in. in length 
was eg to the vertical panel so that it 
s peel nd pas, sereen. Extending from 
second, sim tabe whi d wa ir ME 
fit inside the first tube Ea ni NE a 
to wear the helmet in on puoi. 
availability of visual SEE e m Tae 

lability sual stimuli present in the 
experimental room. 

Centered on the horizontal panel, which 
sloped downward from the bottom of the 
vertical panel, was a vertical wooden lever. 
Fhe apparatus was mounted on a large panel 
which was constructed to fill an opening be- 
tween the experimental room and an ad- 
joining observation room. A chair for S was 
placed in front of the apparatus, and the 
relationship between the chair and the 
apparatus was similar to that of a child’s chair 
and desk in a classroom. 

Located in the observation room was a 
Dunning Animatic filmstrip projector which 
was used to project colors onto the screen. 
The filmstrip contained three random se- 
quences of 10 different colors. Between each 
colored frame of the filmstrip was a blank 
frame which projected a uniform gray onto 
the screen. Also located in the observation 
room was the equipment used to program the 
presentation. of stimuli and to record S’s 


responses. 

Procedure. —Equal numbers of boys and 
girls from each grade level were randomly 
assigned to four groups. All Ss responded by 
pressing the lever during a 4-min. pre- 
experimental period and a 5-min. postexperi- 
mental period. The first 2 min. of the pre- 
experimental period served as a base-rate 
period during which no colors were presented. 
During the last 2 min. of this period and 
throughout the postexperimental period colors 
were presented as reinforcers according to a 
variable ratio schedule with a mean of one 
color for every eight responses. 

The Ss were instructed not to press the 
lever during the experimental period which 
lasted either 2 or 10 min., depending on the 
condition to which S had been assigned. For 
Ss in the two satiation conditions, S10 and S2, 
colors were presented for 10 and 2 min., 
respectively, according to a variable interval 
schedule. These two groups of Ss received a 
color on the average of every 6 sec. Through- 
out the three stages of the study each color 
was presented on the screen for a duration 
of 1 sec. The Ss in the D10 and D2 con- 
ditions received no colors for 10 and 2 min., 


respectively. 
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Each S came individually to the experi- 
mental room. The E began by explaining 
that when the handle was moved during the 
game, different colors would appear on the 
screen. The S was told that he could move 
the handle as quickly or as slowly as „he 
wished and that he could change hands in case 
the one moving the handle got tired. The E 
then presented the helmet, put it over Ss 
head, placed the helmet tube inside the 
apparatus tube, and pushed .5': cha r forward 
so that the lever was easily accessible and so 
that the helmet tube could not be removed 
from the other tube. After S was told that 
the game was ready and to begin moving the 
handle, Æ went into the observation room, 
closing the door behind him. 

At the end of the preexperimental period E 
interrupted S and told him that something 
had happened to the game and that while it 
was being fixed, S was not to move or touch 
the handle. The E then returned to the 
observation room. At the end of the experi- 
mental period E told S the game was ready 
to play once again, and S then began moving 
the handle. The E remained in the observa- 
tion room until the task was over. For each 
S the number of lever-pressing responses was 
recorded separately for each minute of the 
pre- and postexperimental periods. 


RESULTS 


Preexperimental period —The num- 
ber of responses made during the first 
2-min., base-rate period of the task 
provides a measure of performance 
independent of the effects of visual 
reinforcers. The results of a 4 (Con- 
dition) X 2 (Sex) analysis of variance 
of these data showed a significant Sex 
effect, F (1, 104) = 449, p < Qs. 
which reflects the difference between 
the initial rate of lever pressing of 
boys (M = 110.71) and that of girls 
(M = 85.68). Neither the Condition 
effect nor Condition X Sex effect was 
significant (Fs < 1.00). 

"Dung the 2 min. following the 
base-rate period, visual reinforcers 
were made contingent on the lever- 
pressing response. A4 (Condition) 
x 2 (Sex) analysis of variance was 
performed on the number of responses 


made during this period to determine 


if differences existed between the per- 
formance of the groups. The signifi- 
cant Sex effect, F (1, 104) = 7.56, 
p < .01, indicated that the boy's rate 
of response (.M = 132.07) was higher 
than that of the girls (M = 96.84). 
None of the other effects was signifi- 
cant (Fs « 1.00). 

To determine whether there was an 
increase in rate of responding during 
the 2-min. period with reinforcers as 
compared to the base-rate period with 
no reinforcers, difference scores were 
obtained by subtracting the number 
of responses made during the base- 
rate period from the number made 
during the subsequent 2 min. That 
all groups had higher rates of response 
during the reinforcement period than 
during the base-rate period was evi- 
denced by the lack of any negative 
difference scores. An analysis of vari- 
ance of these scores yielded a signifi- 
cant Grand Mean effect, F (1, 104) 
= 51.31, p « .001, indicating that 
rate of response was significantly 
greater during the period with rein- 
forcers than during the base-rate 
period. In addition, the Sex effect 
was once again significant, F (1, 104) 
= 5.04, p < .05. The difference scores 
were larger for boys (7 = 21.36) than 
for girls (M — 11.16), indicating that 
boys showed a greater increment in 
response during the reinforcement 
period than girls. As was the case in 
the other two analyses, the Condition 
and interaction effects were not sig- 
nificant (Fs < 1.00). To determine 
whether the difference scores were 
related to the initial base-rate scores, 
correlations between these scores were 
performed separately for boys and 
girls. Neither the correlation coeffi- 
cient for boys (7 = —.05) nor that for 
girls (r = .15) was statistically signifi- 
cant, indicating that the increment in 
response made during the reinforce- 
ment period was independent of 
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performance 
period. 

Postexpcrimental period —Vhe score 
used in the analysis of the results of 
the postexperimental period was a 
difference score. This score was 
obtained by subtracting the mean 
number of responses made during each 
of the last 2 min. of the preexperi- 
mental period from the number of 
responses made during each minute 
of the postexperimental period. The 
Mean number of responses made dur- 
ing the last 2 min. of the preexperi- 
mental period by Ss in the four condi- 
tions were as follows: D10, 59.27; D2, 
53.54; S2, 56.25; S10, 59.86. Each of 
these values was obtained by averag- 
ing the mean number of responses 
made during each of the 2 min. The 
Mean number of responses made 
during the entire postexperimental 
Period by Ss in the four conditions 
Were as follows: D10, 340.30; D2, 
300.87 ; S2, 287.13; S10, 274.85. 

The difference score represents an 
Increase or decrease in rate of respond- 
Ing during the postexperimental period 
aS compared to rate of responding 
during the reinforcement period of the 
Preexperimental period. It provides 
@ measure of the influence that the 


during the base-rate 


TABLE 1 
ANALYSIS oF VARIANCE OF DIFFERENCE 
Scores ror PostEXPERIMENTAL 
PERIOD 
Source df MS F 
Conditio 43.9. 4.59** 
n (C 3 | 5243.93| 4. 
ep. 10 (6) Dep. 2 1 | 325.73| «1.00 
ep. 1 s. 1 |12825.7 x 
Sex (Gj OFZ ve Sate 1042) p TOL 126 
&XS 3| 1952.04] 1.70 
MIOr 104 | 1141.64 
Minutes (M) 4| 4163.39| 16.82*** 
§XM 12| 90.04] «1:00 
CXM 4| 179.63| <1.00 
R SX M 12| 104.22] <1.00 
FOE 416 | 247.51 
ota] 559 
A PE URN NN 
** 
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TABLE 2 


MEAN DIFFERENCE SCORES ror THE Four 
GROUPS ACROSS THE 5 Mix. 


experimental conditions had on the 
rate of responding. 

The mean difference scores for Ss in 
the four experimental groups were as 
follows: D10, 43.95; D2, 33.17; S2, 
5.88; 510, —24.45. By inspecting the 
relationship between these means it 
can be seen that rate of responding 
decreased as deprivation decreased 
and as satiation increased. These 
data were analyzed by means of a 4 
(Condition) X 2 (Sex) X 5 (Minutes) 
analysis of variance employing or- 
thogonal comparisons. Comparisons 
were made between the performance 
of the D10 and D2 groups, the S10 and 
S2 groups, and the combined D10, D2 
and combined S10, and S2 groups. 
The results of the analysis are sum- 
marized in Table 1. 

It can be seen in Table 1 that the 
Condition effect was significant, in- 
dicating that the groups differed in 
their overall rate of response. The 
significant D10, D2 vs. S10, S2 com- 
parison was based on the fact that the 
total of the combined difference scores 
of the deprivation groups was greater 
than that of the satiation groups. 

The only other significant effect 
was the Minutes effect. It can be seen 
in Table 2, which presents the mean 
difference scores for the four groups 
across the 5 min., that all groups 
showed increasing increments of re- 
sponse throughout the postexperi- 
mental period, 
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DISCUSSION 


The analysis of the base-rate scores 
indicated that the boys had a signifi- 
cantly higher rate of response than the 
girls. A possible explanation for this 
finding is that boys found the task more 
interesting than the girls due perhaps to 
parts of the apparatus that could be 
associated with space equipment. 

When rate of response for the pre- 
experimental period with reinforcers was 
compared with rate for the base-rate 
period, the difference scores were higher 
for boys than for girls; indicating that the 
introduction of visual reinforcers resulted 
in greater increments in response for boys 
than for girls. This finding lends support 
to the reliability of the results of earlier 
studies (Stevenson & Odom, 1961, in 
press) which showed visual stimuli to be 
more effective as reinforcers for boys 
than for girls. 

The lack of significant Condition and 
Condition X Sex effects in the three 
analyses of the data for the preexperi- 
mental period was anticipated, since con- 
ditions were identical for all groups 
during this period, and there was there- 
fore, no reason to expect differences in 
performance among the four groups. 

That the various conditions affected 
the rate of incremental responding, or, in 
other words, the effectiveness of the rein- 
forcers, was demonstrated by the differ- 
ences in rate of responding among the 
four groups. That the four groups 
demonstrated an increase in increment 
of response across minutes is assumed to 
be due, at least partially, to the ability 
of the visual reinforcers to maintain and 
strengthen the response regardless of the 
condition. 

In addition to the present study, 
several studies, cited earlier, have in- 
dicated that visual stimuli have rein- 
forcing characteristics that are effective 
in modifying children's behavior. In the 
present study Ss in the ae groups 
showed higher rates of responding during 
the preexperimental period when visual 
reinforcers were presented than ede. 
the base-rate period when no reinforcers 


ere presented. Behavior was further 
Wi 
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modified by the deprivation and satiation 
operations employed during the experi- 
mental period, so that rate of responding 
increased as a function of decreasing 
satiation and increasing deprivation. 

In light of these results it seems reason- 
able to speculate about the possibility of 
visual stimulus deprivation and satiation 
as a source of motivation. Throughout 
the task the amount of visual stimulation 
was reduced considerably from the 
amount normally available to Ss, with 
the greatest reduction occurring for the 
D10 group and the least reduction oc- 
curring for the S10 group. It is possible 
that certain processes within the organ- 
ism may adapt to a particular level of 
visual stimulation. When the amount of 
stimulation falls below this level, the 
organism may become generally respon- 
sive, with certain responses being rein- 
forced by visual stimuli which conse- 
quently increase the probability of the 
reoccurrence of that response. On the 
other hand, when such a nced state exists, 
it may activate specific responses that are 
designed to obtain specific stimuli. 

The results of the present study indi- 
cate that visual deprivation and satiation 
have certain behavioral effects that are 
much like those of similar operations 
using other classes of reinforcers, It is 
possible that a process similar to that 
occurring with deprivation and satiation 
operations using primary reinforcers also 
exists when sensory reinforcers are used. 
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al length bein 
ted by 
Experiment II, Ss served in 4 intertrial 
alone; (b) display of stimulus pattern; 
Tesponse correctness; or (d) both b 
were used for each condition: 1, 15, or 29 sec. 
did not worsen during 
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pt-identification problems under 
4 intertrial durations (1-25 sec.) ; 
l 5 irrelevant stimulus dimensions) ; 
lus patterns (self-paced vs. 
improved, then got worse with in- 
§ greater in more complex 
mode of stimulus control. 
conditions: (a) simple time-out 
(c) display of signal indicating 
and c. 3 intertrial durations 
"Trends were the same, 
longer intervals under Condi- 


order given, the following critical 
events: (a) the appearance of a stim- 
ulus pattern; (b) S's categorizing re- 
Sponse to it; and (c) the immediate 
withdrawal of the pattern and pres- 
entation of informative feedback (IF) 
that signals the correct response. 
Then a brief intertrial interval, often 
referred to as the postfeedback inter- 
val, intervenes before commencement 
of a new trial. 

Bourne and Bunderson (1963) found 
that increasing the length of the inter- 
trial intervals significantly reduced 
number of errors and trials to solution 
in identification problems. The effect, 
moreover, was related to task com- 
plexity, since greater facilitation was 
observed in problems complicated by 
larger numbers of irrelevant stimulus 
dimensions, ie., variations among 
stimulus patterns not related to 
problem solution. 


The restricted range of intertrial 
624 
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durations explored in that study, 1-9 
sec., was limited by a requirement that 
conditions overlap those of an earlier 
experiment (Bourne, 1957). Thus 
available information on the general 
form of the relationship between per- 
formance and the intertrial interval 
(and on their interaction with task 
complexity) is incomplete. Obvi- 
ously, the facilitation reported by 
Bourne and Bunderson must diminish 
beyond 9 sec., if only because of a 
ceiling effect. A more compelling 
reason to explore a greater range of 
times in concept-identification prob- 
lems is provided by the possibility 
that longer intertrial intervals may 
interfere with performance, an effect 
which has been observed in other 
learning tasks (¢.g-, Bilodeau & Bilo- 
deau, 1958). ‘There is, moreover, the 
question of the composition of the in- 
tertrial interval—the inclusion or ex- 
clusion of various, task-related sources 
of information may greatly affect per- 
formance. It was the purpose of these 
experiments to explore further the 
role of intertrial variables in concept 
identification. 


GENERAL METHOD 


, Task and apparatus.—A complete descrip- 
tion of procedural details has been given 
(Bourne, 1957). Briefly, a 16-mm. stripfilm 
Projector was used to present S with a series 
of stimulus patterns (geometric designs) each 
of which was a combination of the levels of 
two relevant plus one or more irrelevant 
binary stimulus dimensions, such as color 
(red-green) and form (square-triangle). To 
identify the category to which any particular 
pattern belonged, S pressed one of four, 
unlabeled response keys; the task was to 
discover the two relevant dimensions and how 
the four conjunctive combinations of their 
associated with the four keys. 


levels were 1 : 
S was immediately 


Following each response, 
informed of the correct category for the 
stimulus pattern by the onset of an IF light 
Over an appropriate key. The sequence of IF 
signals was controlled by à Western Union 
tape transmitter equipped with a preco! ed 
tape that matched the stimulus series. Asin 


625 


carlier studies, both levels of irrelevant 
dimensions appeared within the sequence of 
stimulus patterns but were uncorrelated with 
IF signals. Electronic timing units con- 
trolled duration of IF (1 in Exp. I and 
in control conditions of p. ID, and the 
length of the intertrial interval. The S’s 
responses were recorded on an Esterline- 
Angus operations recorder. All Ss were 
initially given detailed instructions about the 
task, the meaning of IF signals, and the 
criterion of problem solution—16 consecu- 
tively correct responses. 


EXPERIMENT | 


The major aim of Exp. I was to ex- 
plore, in concept-identification prob- 
lems characterized by different degrees 
of complexity, a more extensive range 
of intertrial interval than used in 
previous studies. lt had the second- 
ary purpose of gathering evidence on 
the relationship between S's activities 
during this interval, and another, the 
time from onset to termination of the 
stimulus pattern. Usually, the length 
of stimulus exposure has been deter- 
mined by S, terminating only when S 
The effect of vary- 
ing the intertrial interval, no matter 
how interpreted, must be somewhat 
independent of stimulus duration, for 
Ss apparently cannot or do not com- 
pensate for short intertrial durations 
by lengthening the stimulus interval 
(Bourne & Bunderson, 1963). It may 
be, however, that fixing the stimulus 
interval at some brief length would 
interfere with performance when the 
intertrial interval is also brief, thus 
implying an interaction between S's 
activities during the two periods. To 
assess this possibility, the plan of the 
present experiment included parallel 
conditions with E-determined (fixed) 
and S-determined (self-paced) stim- 
ulus intervals. 


made a response. 


Method 


Subjects —The 192 Ss participating in the 
experiment were volunteers from introductory 
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psychology courses. Each was assigned to 
one of 16 experimental conditions and served 
individually for one session. 

Design A 45272 factorial design 

combined four lengths of intertrial interval 
(1, 9, 17, or 25 sec.), two degrees of task 
complexity (problems with one or five 
irrelevant stimulus dimensions), and two 
stimulus interval conditions (self-paced, or 
fixed at 5 sec. on all trials). The Ss perform- 
ing on the fixed interval were instructed to 
respond while the stimulus was on the viewing 
screen and that failure to respond would be 
counted as an error; fewer than 1% of 
observed errors involved failure to respond. 
‘The 5-sec. fixed duration was selected because 
an informal pilot study revealed this to be the 
approximate mean length of .S-determined 
intervals on precriterion trials. Lengths of the 
intertrial interval were chosen after an in- 
spection of data from the same pilot study, 
which suggested the existence of an optimal 
duration of less than 25 sec. for problems with 
five irrelevant dimensions, 

Two problems were used for each level of 
complexity; the two were equivalent save for 
differing relevant dimensions, Each was used 
equally often in all 16 main conditions—to 
reduce the possibility of collusion among Ss 
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a joint function of length of intertrial 
bete and number of irrelevant stimulus 
dimensions. (Each plotted point represents 
the data of 24 Ss.) 
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and to increase the generality of conclusions 
warranted by the data. 


Results 

Errors to solution were reliably 
affected by three factors: number 
of irrelevant dimensions, F (1, 160) 


= 151.89; length of the intertrial 
interval, F (3, 160) = 54.16; and the 
interaction of these two variables, 
F (3, 160) = 10.13; all with p <01. 
The interaction of irrelevant dimen- 
sion and intertrial interval was further 
analyzed into orthogonal components 
of trend : both the linear and quadratic 
terms were reliable; 
(1, 160) = 18.53 and F 
(1, 160) = 10.08, b «.0l. The in- 
fluence of these main variables is 
summarized in Fig, 1. The function 
relating errors to length of the inter- 
trial interval has a minimum, the 
value of which increases as does the 
Overall difficulty of the task, with 
number of irrelevant dimensions. 

No other main effect or interaction 
contributed Significant variance to 
performance data as indexed by 
errors. Analysis of trials-to-solution 
data yielded the same outcome. 


P'irrel. x IntertrialiLin. 
Irrel, X Intertrial:Quad. 


Discussion 


The results of Exp. I essentially 
replicated an earlier finding (Bourne & 
Bunderson, 1963) that a moderate in- 
Crease in the length of the intertrial 
Interval (to 9 sec.) produced reliable 
improvement in overall performance on 
concept-identification problems. Fur- 
ther, the data indicate the lack of a 
substantia] interaction between S's ac- 
tivities during intertrial and stimulus 
intervals, for the Performance trends 
were the same when duration of the stim- 
ulus Was fixed or self-determined. 
mentioned by Bourne and Bunderson, 


there are several plausible interpretations 
of facilitation d 


Intervals; for ex 
Provide oppor 


ide tunity: to associate char- 
acteristics of 


the stimulus with the 
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signaled response category; to rehearse 
retained information; and/or to formu- 
late new hypotheses about solution. 
These possibilities, however, are incom- 
plete, since they predict only an asymp- 
tote and not the deleterious effect 
observed when intertrial intervals exceed 
9-17 sec. A second controlling process 
apparently determines the optimizing of 
performance, but available evidence fails 
to specify its source(s). 


T LI 


IZXPERIME 


In Exp. II, the intertrial interval 
not only was varied in length, but also 
incorporated the four possible condi- 
tions involving presence or absence of 
stimulus pattern and corresponding 
lF signal from the preceding trial. 
l'he purpose was to determine whether 
availability of either source of in- 
formation would counteract the inter- 
ference effect associated with longer 
intertrial intervals. Such a result 
would provide indirect evidence on 
the importance of memory for previ- 
ously given information as a deter- 
miner of rate of problem solving. 


Method 


Subjects —Vhe Ss were 144 undergraduate 
college students who volunteered for par- 
ticipation in the experiment. Seventy-two Ss 
were drawn from elementary psychology 
courses offered at the University of Colorado 
and 72 were enlisted from various colleges and 
universities in the Missouri-Kansas arca. 
Each S was assigned randomly to one of the 
12 main conditions, and served individually 
for a single session. . 

Design.—The experimental plan combined 
factorially three lengths of intertrial interval 
(1, 15, and 29 sec.) with the presence or 
absence (during the intertrial interval) of 
either or both the stimulus pattern and the 
"hus, the conventional condition 
xperiments, where 
is available 


IF signal. T 
for concept-identification e 
neither pattern nor IF signal 
during the intertrial interval, can be compared 
systematically with conditions wherein either 
or both remain on display. The lengths of 
intertrial intervals were selected to overlap 
the range of times used in Exp. I. Two 


N 
- 


p 


MEAN ERRORS 


O Stimulus absent, IF absent 
@ Stimulus absent, IF present 


[ D) Stimulus present, IF absent ] 
Wl Stimulus present, IF present 
al p "1 
- 4 
! 1 1} 
d —8À — 
1 is 29 


LENGTH OF INTERTRIAL INTERVAL, SECONDS 


Fic. 2. Mean errors to solution as a 
function of length of intertrial interval and 
the presence or absence of stimulus pattern 
and/or IF signal on any trial. (Each plotted 
point represents the data from 12 Ss. All 
conditions utilized five irrelevant stimulus 
dimensions.) 


equivalent problems (differing only in terms 
of the dimensions chosen to be relevant but 
with five irrelevant dimensions in. each case), 
were used equally often for Ss in all main 
conditions. The source of Ss was systemat- 
ically balanced, with equal Ns from the two 
locations serving in each condition. 


Results 


Analysis of variance on errors to 
solution revealed the effect of length of 
intertrial interval, F (2,96) = 30.78, 
p < .01, to be essentially the same as 
that observed in Exp. 1. Overall per- 
formance was best with the inter- 
mediate (15-sec.), and worst with the 
shortest (1-sec.) interval. There was, 
in addition, a general facilitative 
effect due to the availability of the 
stimulus pattern during the intertrial 
interval, F (1,96) = 4.35, p < .05. 
The presence or absence of the accom- 
panying IF signal was an ineffective 
variable. There were no reliable 
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interactions between the three main 
variables. However, the conditions 
wherein the stimulus pattern did 
or did not persist during a 29-sec. 
intertrial interval did differ reliably, 
F (1, 96) = 3.95, p< 05. Figure 2 
portrays errors to solution „as a 
function of the three main variables; 
performance did not degrade during 
the longest intertrial interval when the 
stimulus pattern of the preceding trial 
was continuously available. 

Performance on the two problems, 
in spite of their formal equivalence, 
was reliably different in the present 
study, F (1, 96) = 9.87, p < .01, and 
there is also a significant interaction 
between problems and the location 
from which Ss were recruited, F (1, 96) 
= 4.29, p < .05. Further analysis of 
this interaction demonstrated that the 
problem difference was considerably 
larger in the Kansas-Missouri than in 
the Colorado sample. 

Findings in an analysis of variance 
based on total trials to criterion were 
the same as those based on the error 
analysis, except that the presence or 
absence of the stimulus pattern during 
the intertrial interval failed to produce 
a reliable difference. 


Discussion 


The data of Exp. I and II clarify some 
of the relations between intertrial vari- 
ables and concept-identification perform- 
ance. Under conventional procedure 
(where neither stimulus pattern nor feed- 
back signal is availed S during the inter- 
trial interval), performance improved 
and then worsened as the intertrial in- 
terval lengthened from 1 through 29 sec. 
In Exp. I, the optimum of this curvilinear 
relation was found to shift asa function 
f problem complexity; on the easier of 
oe ks, optimal performance was asso- 
ns ith a much shorter interval. 
ement in performance produced 
te increases in intertrial dura- 


likely results from enhanced 


ciated wi 

Improv 
by modera 
tion very 
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opportunity for S to associate the stim- 
ulus pattern with the signaled response 
category, and/or make appropriate infer- 
ences about relevant or irrelevant aspects 
of the stimulus. The observed optimiz- 
ing and subsequent deterioration of per- 
formance under longer intertrial intervals 
strongly suggests the influence of a 
second controlling process—an interfer- 
ence effect that accumulates during and 
across intertrial intervals, eventually 
overcoming the gains wrought by moder- 
ateintertrial durations. In simpler prob- 
lems, with fewer irrelevant dimensions 
and less variable stimulus patterns, less 
benefit would derive from intertrial 
practice; interference effects, therefore, 
would more likely be manifest at shorter 
intertrial durations, 

While it is difficult to specify the 
source of such interference effects, loss of 
memory (for information provided by 
previously displayed pattern stimuli) is 
quite likely. The results of Exp. II 
demonstrated that performance did not 
deteriorate, even with the longest inter- 
trial interval used, when stimulus pat- 
terns remained available to S. Indeed, 
fewest mean errors occurred in the condi- 
tion where both stimulus and IF signal 
were on display throughout at 29 sec. 
intertrial interval. However, perform- 
ance differences accruing to presence or 
absence of IF signals per se were sta- 
tistically negligible. 

The data are open to several inter- 
pretations. It may be that Ss deprived 
of stimuli during intertrial intervals 
retain but a fraction of total information 
availed in previous intratrial displays. 
If forgotten aspects of stimuli are rele- 
vant, or if remembered aspects are 
irrelevant, and if either is improperly 
associated with a response category, 
there would certainly result a delay in 
Problem solution. Such an interpreta- 
tion is consistent with related findings of 
Cahill and Hovland (1960), and of 
Bourne, Goldstein, and Link (1964), 
which demonstrated that the majority of 
errors made in concept learning are due 
to utilization of hypotheses that are in- 
compatible with information provided by 
Previously seen but no longer available 
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stimuli. Incorrect hypotheses, thus, are 
largely attributable to S's failure to re- 
tain or retrieve information about earlier 
displays of pattern stimuli. 

Finally, the results of the present study 
may have been due in part to another 
controlling factor: loss of attention or 
vigilance to task. Prolonged intertrial 
intervals devoid of task-related stimuli 
might well conduce to such a loss. This 
and other possible sources of interfer- 
ence remain to be assessed in further 
experiment. 
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EFFECTS OF ORDERED AND CONSTANT SUCROSE 
CONCENTRATIONS ON NONREINFORCED 
PERFORMANCE ! 
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2 experiments were performed to determine the effects of different 
sucrose concentrations on nonrcinforced performance. In the Ist 
experiment, the concentrations (4%, 8%, 32%, 64%) were presented 


in either an increasing or decreasing order on alternate days 


all Ss 


experienced both orders. In the 2nd experiment, Ss received only 1 of 
the above concentrations. The results indicated that the response rate 
preceding extinction was not an adequate basis for predicting non- 
reinforced performance. An alternative view, which focused on the 
stimulus properties of the sucrose reinforcers as predictors of resistance 


to extinction, is presented. 


The present studies were designed 
to determine if nonreinforced perform- 
ance can be better predicted from the 
response rate immediately preceding 
extinction or from the value of the 
sucrose incentive preceding extinction. 
In his review article on magnitude of 
reinforcement, Pubols (1960) con- 
cluded, along with Metzger, Cotton, 
and Lewis (1957), that resistance to 
extinction is best predicted by the 
terminal response rate—the rate of 
responding immediately preceding ex- 
tinction. Others (Campbell, 1958; 
Guttman, 1953; Young & Green, 
1953) indicate that some dimension of 
the reinforcement properties—taste, 
receptor sensitivity, or a motivational 
property, is a more important vari- 
able. Marx, Tombaugh, Cole, and 
Dougherty (1963) trained two groups 
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of rats to bar press on either an in- 
creasing (4%, 8%, 32%, 64%) or 
decreasing (64%, 32%, 8%, 4%) 
order of sucrose incentives, with con- 
centrations progressively changed be- 
tween each of four consecutive 30-sec- 
nonreinforced test trials. Although 
Ss bar pressing on the increasing order 
had a reliably lower terminal response 
level than those on the decreasing 
order, they pressed more during the 
nonreinforced trials. These results 
suggest a motivational determination 
of resistance to extinction, and sup- 
port the interpretation that extinction 
is essentially a function of a motiva- 
tional decrement (Marx, 1958, 1960). 
That is, Ss which received progres- 
sively more desirable incentives (in- 
creasing order) were more motivated 
to respond during extinction than SS 
offered progressively less desirable 
incentives (decreasing order). 

TThe first experiment incorporated 
the same basic design except that 
every S was exposed to both incentive 
orders. Thus, on alternating days 
each S was given one of the two orders. 
The purpose of this daily alternation 
was to determine whether the previ- 
ously found differences between train- 
ing and extinction would still vary 
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systematically as a function of the 
incentive order, or if experience with 
both orders would eliminate these 
differences. That is, we attempted to 
determine whether the previously 
demonstrated behavior is a reversible 
phenomenon, as one would expect if 
some motivational aspect of the rein- 
forcement were assumed to be the 
controlling factor. The present de- 
sign has the important methodological 
advantage of permitting each S to 
serve as its own control. 


EXPERIMENT | 
Method 


Subjects—The Ss were 16 naive female 
albino rats purchased from the Sprague- 
Dawley Company, which were approximately 
90 days old at the beginning of the experiment. 

A pparatus.—The apparatus was the same 
as described by Marx et al. (1963). In brief, 
it consisted of eight operant-conditioning 
boxes in which each bar press (BP) mechan- 
ically operated a dipper located below the bar. 
The bar could be made inoperative by means 
of a mechanical lock. A light above the bar 
served as a discriminative cuc. 


Procedure 


_ Training.—Three days prior to the be- 
ginning of the experiment Ss were placed on 
23-hr. water deprivation. All Ss were given 5 
days of dipper training. On each day, water- 
loaded dippers were manually presented 20 
times, with the bar absent, at 60-sec. intervals. 
For the next 5 days Ss received BP training. 
For 20 min, they were placed in the boxes 
with the bar positioned and allowed to bar 
Press continuously for water. Following the 
termination of BP training, discrimination 
training began. On the first day of dis- 
Crimination training, the 20-min. BP session 
was broken down into eight 120-sec. response 
trials separated by seven 30-sec. nonresponse 
intervals, During response trials the bars 
Were operative and the discrimination light 
Was off. During nonresponse trials the bars 
Were inoperative and the light was on. Over 
the first 4 days the light-off, bar-operative 
time was gradually decreased from 120 sec. 
to 30 sec., while the light-on, bar-inoperative 
time was increased from 30 sec. to 120 sec. 
On the fifth and sixth days of discrimination 
training there were eight 30-sec. response trials 


separated by seven 120-sec. nonresponse 
trials. Following the sixth and last day of 
discrimination training Ss were placed on 
ad-lib water and 23-hr. food deprivation. 
Two days later the testing phase was initiated. 

Testing. —On the first day half of the Ss 
(Group A) received an increasing order of 
sucrose (4%, 8%, 32%, 64%) on the first four 
consecutive response trials. The other half 
(Group B) received a decreasing order (64%, 
32%, 8%, 4%). Reinforcement was con- 
tinuous. No reinforcement was given during 
the last four response trials. On Day 2, Ss 
which had previously received the increasing 
order, now received the decreasing order, and 
vice versa. During the 12 test days the orders 
were alternated daily. The number of BPs 
during each of the eight response trials was 
recorded. 


E 


Results 


Figure 1 shows the mean BPs for 
each of the incentive orders summed 
over days. The results were clearly as 
predicted, confirming the earlier re- 
sults of Marx et al. (1963). There 
was a lower terminal reinforced rate 
for the increasing order than for the 
decreasing order, but this relationship 
was reversed in the following nonrein- 
forced trials. 

An analysis of variance appropriate 
to a three-factor, related-measures 
design was performed on the data for 
the 12 test days. All Fs indicated 
reliable differences except the order 
variable summed over reinforcement 
and trials. The F of major interest 
was the Order X Reinforcement inter- 
action. This interaction was reliable, 
F (1,15) = 12.13, p « .005, indicat- 
ing a change in performance from 
reinforced to nonreinforced trials as a 
function of incentive order. À break- 
down of the analysis, using the appro- 
priate error terms from the major 
analysis, indicated that the total num- 
ber of BPs during reinforcement did 
not differ reliably for the two orders, 
F (1,15) = 1.70, p > .05, but the in- 
creasing order did produce reliably 
more BPs than the decreasing order 
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during nonreinforcement, F (1, 15) 
= 17.72, p « .001. Also, the inter- 
action between order and the last 
reinforced trial and first nonreinforced 
trial was reliable, B, 15) = 83.61, 
p < .001, indicating an immediate 
shift in bar-press rate. During the 
last reinforced trial Ss pressed less 
under increasing order than under 
decreasing order, but they Pressed 
nore in the first nonreinforced trial 
after the increasing order. PONY 
The BP scores on the last reinforced 
] first nonreinforced trial, averaged 
ae ll of the increasing and de- 
pum wa order sequence separately, 
rece morpared for each $5 des 
data, in which each S served as its ow 


Mean number of bar presses for 12 d 
of incentive order and trials, 


ays as a function 


control, offer convincing confirmation 
of the group differences: for the in- 
creasing incentive order, 15 of the 16 
Ss averaged more BPs on the first 
nonreinforced than on the final rein- 
forced trial, while for the decreasing 
incentive order the opposite result 
occurred for 14 of the 16 Ss. A Q test 
for related samples (Cochran, 1950) 
showed these differences to be reliable 
(5b « 001). 

A similar analysis considering only 
the first 2 test days revealed that the 
Same relationship between the last 
reinforced and first nonreinforced trial 
Was present from the beginning of test. 
On the increasing incentive order 15 
of the 16 Ss bar pressed more on the 
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first nonreinforced trial than on the 
last reinforced trial, while on the 
decreasing order the opposite result 
occurred for 9 of the 16 Ss. A Q test 
showed these differences to be reliable 
ib « 05). 

It was hypothesized that the results 
of the first study were primarily at- 
tributable to the terminal concentra- 
tion, independent. of ordering, and 
that the nonreinforced responding was 
related directly to sucrose concentra- 
tions. To test these assumptions the 
same design was again used, except 
that no order was incorporated. Four 
groups of Ss were used. Each group 
received one of the following concen- 
trations on all four of the reinforced 
trials: 4%, 8%, 32%, 64%. It was 


Meon BP 
o 


expected that the decreasing and in- 
creasing groups and the constant 4% 
and 64% groups, respectively, would 
demonstrate the same function for 
the terminal reinforced trial and the 
nonreinforced trials. The 8% and 
326 groups were included in order to 
provide more data, so that the rela- 
tionship between the terminal concen- 
tration. and resistance to extinction 
could be more adequately assessed. 


EXPERIMENT II 
Method 
Subjects and apparatus.—The Ss were 32 
naive female albino rats which were approxi- 


mately 120 days old at the beginning of the 
experiment and were purchased from the 
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Fic. 3. Mean number of bar presses during training and extinction 
as a function of sucrose concentrations. 


Sprague-Dawley Company. The apparatus 
was the same as was used in Exp. I. 


Procedure 


Training and fest—The same training 
procedure as used in Exp. I was incorporated. 
In test, the 32 Ss was divided into four groups 
designated on the basis of the different sucrose 
concentrations employed: 4%, 8%, 32%, 
64%. On each of the first four consecutive 
30-sec. response trials the groups received the 
same sucrose concentrations. The Ss were 
not reinforced during the last four consecutive 
response trials. The Ss were tested 6 days a 
week for 4 wk. 


Results 

The number of BPs per S for each 
response trial was summed over the 
first 12 days. Group means are shown 
in Fig. 2. An analysis appropriate to 


a three-factor experiment in which 
two factors are repeated measures was 
performed on these data. The vari- 
ables under consideration were con- 
centration, reinforcement, and trials. 
All differences were reliable, but the 
main interest was the Concentration 
X Reinforcement interaction, F (3,28) 
= 10.37, p < .005. This analysis re- 
flected the highest rate of reinforced 
performance by the 8% group, fol- 
lowed by the 4%, the 32%, and the 
64% groups, respectively, while during 
nonreinforcement, BP rate was di- 
rectly related to concentration. Also; 
the interaction between trials and 
concentration was reliable for the 
reinforced trials, F (9, 80) = 3.66; 
P < .005. The interaction was due to 
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the stable performance over trials by 
the 49%, 805, and 32% groups as op- 
posed to a large decrement in respond- 
ing by the 6465 group. 

Figure 3 shows the mean perform- 
ance as a function of sucrose concen- 
tration, collapsed over trials. The 
group differences during reinforce- 
ment were reliable, F (3, 28) = 28.69, 
p < .005, as were the differences 
during nonreinforcement, F (3, 28) 
=10.70, p < .005. It should be noted 
that the differences during nonrein- 
forcement were a linear function of 
the sucrose concentration. 

An analysis based on the same 
information for Wk. 3 and 4 showed 
a continuation of the reinforced be- 
havior, but no reliable differences 
during nonreinforcement were present, 
due to a general lack of responding for 


all groups. 


Discussion 


Although Pubols (1960) has indicated 
that terminal response rate (rate of rein- 
forced responding immediately preceding 
extinction) predicts resistance to extinc- 


tion, the present experiments question 
the generality of such a view. The in- 
factor is 


adequacy of the response-rate 
clearly illustrated in both experiments by 
the existence of an interaction between 
the last reinforced trial and the first non- 
reinforced trial. In Exp. I, when all the 
nonreinforced trials were considered, 
lower terminal response rates were asso- 
ciated with reliably more bar pressing 
than higher terminal response rate. This 
Same trend was also present in Exp. II. 
Thus a lower terminal reinforced response 
rate may, under certain conditions, be 
followed by a greater number of non- 
reinforced BPs. 

The present results 
critical factor influenc 
extinction is the terminal sucrose concen- 
tration used as the incentive. Several 
facts support this view. In Exp. I, the 
interaction between the last reinforced 
and first nonreinforced trials was present 


suggest that the 
ing resistance to 
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sven though received different incen- 
E E ating days. This 

suggests the pervasive influence of the 
TY guise dss each eeina 

ra d č arious concen- 
trations, irrespective of the order (cf. 
Points 4%, 8%, and 32% concentration, 
Fig. 1), were remarkably consistent when 
considered over the 12 test days. This 
fact suggests that the incentive shift had 
relatively little influence upon the BP 
rate for the different concentrations, and 
that the terminal concentration (rather 
than the ordering of the incentives) pro- 
duced the interaction effects and extinc- 
tion differences. Experiment II, which 
was designed to test the above hypoth- 
esis, appears to confirm it if one compares 
the 4% and 64% groups on the last 
reinforced trial and the nonreinforced 
trials. These groups have essentially the 
same relationship as the decreasing and 
increasing groups in Exp. I and the 
Marx et al. study. Thus it may be 
concluded that the terminal level of 
sucrose is an important, if not the most 
important, determinant of the interaction 
and extinction differences. 

Figure 3 strikingly demonstrates the 

relationship between the terminal con- 
centration and resistance to extinction. 


Here, there exists a direct linear function 
oncentration and 


between the sucrose € 
the number of nonreinforced BPs. It 
may be that the sucrose concentration 


is a good indicator of resistance to ex- 
tinction, in that the greater the concen- 
tration the greater the resistance to 
extinction, providing the concentrations 
are discriminable, i.e., logarithmically 
scaled. However, this is merely specula- 
tion on our part, and before any such 
generalization is made further experimen- 
tation with other designs is necessary. 
The results of both experiments indi- 
cate the necessity of incorporating some 
type of nonassociative explanation to 
account for the differential persistence of 
nonreinforced responding. Any of the 
several current motivation, frustration, 
or cognitive theories could be applied in 
interpreting the data. With respect to 
the present experiments, all of these 
theories would assume that the groups 


536 TOM N. TOMBAUGH AND MELVIN H. MARX 


receiving the higher concentrations at the 
end of training would be more highly 
motivated (or have greater expectations) 
than those receiving the lower concen- 
trations. However, greater motivation 
is not readily apparent in the present 
experiments, since the number of BPs 
on the last reinforced period is inversely 
related to concentration, with the pos- 
sible exception of the 4% group in 
Exp. II. 

The typical explanation for the failure 
of BP rate to reflect motivation is 
physiological satiation, which should 
produce a between-trial BP decrement. 
Since the only appreciable decrement in 
response rate is for the 64% group, 
physiological satiation does not explain 
all of the results. A second explanation, 
which applies only to experiments in 
which continuous reinforcement is used, 
is that consummatory behavior con- 
stitutes an interfering response relative 
to the BP response. There is evidence 
that the duration of consummatory 
activity is a direct function of increased 
sucrose concentration. Smith and Duffy 
(1957) demonstrated that with high con- 
centrations of sucrose, Ss had more 
persistent drinking behavior than with 
lower concentrations. Unpublished data 
from our laboratory support this finding 
for all solutions of 895 or above. Con- 
sequently, the low response rate for high 
concentrations may be an indirect effect 
resulting from satiation and perseverative 
licking behavior. Studies which have 
minimized the effects of satiation and 
perseverative licking through the use of 
noncontinuous reinforcement have found 
a direct relationship between motivation 
and sucrose concentration (Collier & 
Siskel, 1959; Guttman, 1953, 1954; 
Young & Green, 1953; Young & Shuford, 
1954). In the present study, with con- 
tinuous reinforcement, the fact that this 
relationship is not found in the reinforced 
BP rate is because of the overlaying 
effects of physiological satiation and 


licking behavior. 
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Competing Responses and the Partial-Reinforcement fect: Donald F. McCoy, Jr.* and 
Melvin H. Marx: Department of Psychology, University of Missouri, Columbia, Missouri 
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Partial Visual Feedback of Component Motions as a Function of Difficulty of Motor Control: 
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Preference for a Positive Evaluative Response in Concept Learning: Ramon J. Rhine*: De- 
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Conditioning Acceptance or Rejection of Information: B. R. Bugelski* 
Department of Psychology, Townsend Hall, 1 
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